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Abstract. After a brief flirtation with logicism in 1917-1920, David Hilbert proposed his 
own program in the foundations of mathematics in 1920 and developed it, in concert with 
collaborators such as Paul Bemays and Wilhelm Ackermann, throughout the 1920s. The two 
technical pillars of the project were the development of axiomatic systems for ever stronger 
and more comprehensive areas of mathematics and finitistic proofs of consistency of these 
systems. Early advances in these areas were made by Hilbert (and Bernays) in a series of 
lecture courses at the University of Gottingen between 1917 and 1923, and notably in Ack- 
ermann's dissertation of 1924. The main innovation was the invention of the e-calculus, on 
which Hilbert's axiom systems were based, and the development of the e-substitution method 
as a basis for consistency proofs. The paper traces the development of the "simultaneous 
development of logic and mathematics" through the £-notation and provides an analysis of 
Ackermann's consistency proofs for primitive recursive arithmetic and for the first compre- 
hensive mathematical system, the latter using the substitution method. It is striking that these 
proofs use transfinite induction not dissimilar to that used in Gentzen's later consistency 
proof as well as non-primitive recursive definitions, and that these methods were accepted 
as finitistic at the time. 



1. Introduction 

Hilbert first presented his philosophical ideas based on the axiomatic method 
and consistency proofs in the years 1904 and 1905, following his exchange 
with Frege on the nature of axiomatic systems and the publication of Russell's 
Paradox. In the text of Hilbert's address to the International Congress of 
Mathematicians in Heidelberg, we read: 

Arithmetic is often considered to be part of logic, and the traditional 
fundamental logical notions are usually presupposed when it is a question 
of establishing a foundation of arithmetic. If we observe attentively, 
however, we realize that in the traditional exposition of the laws of logic 
certain fundamental arithmetic notions are already used, for example, 
the notion of set and, to some extent, also that of number. Thus we 
find ourselves turning in a circle, and that is why a partly simultaneous 
development of the laws of logic and arithmetic is required if paradoxes 
are to be avoided. ' 

When Hilbert returned to his foundational work with full force in 1917, he 
seems at first to have been impressed with Russell's and Whiteheads's work 
in the Principia, which succeeded in developing large parts of mathematics 
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without using sets. By 1920, however, Hilbert returned to his earUer conviction 
that a reduction of mathematics to logic is not Ukely to succeed. Instead, he 
takes Zermelo's axiomatic set theory as a suitable framework for developing 
mathematics. He locahzes the failure of Russell's logicism in its inabihty to 
provide the existence results necessary for analysis: 

The axiomatic method used by Zermelo is unimpeachable and indispensable. 
The question whether the axioms include a contradiction, however, remains 
open. Furthermore the question poses itself if and in how far this axiom 
system can be deduced from logic. [. . . T]he attempt to reduce set theory 
to logic seems promising because sets, which are the objects of Zermelo's 
axiomatics, are closely related to the predicates of logic. Specifically, 
sets can be reduced to predicates. 

This idea is the starting point for Frege's, Russell's, and Weyl's investigations 
into the foundations of mathematics.^ 

The logicist project runs into a difficulty when, given a second-order predicate 
S to which a set of sets is reduced, we want to know that there is a predicate to 
which the union of the sets reduces. This predicate would be {3P) {P{x)&S{P)) — 
X is in the union of the sets in 5 if there is a set P of which x is a member and 
which is a member of S. 

We have to ask ourselves, what "there is a predicate P" is supposed to 
mean. In axiomatic set theory "there is" always refers to a basic domain 
03. In logic we could also think of the predicates comprising a domain, 
but this domain of predicates cannot be seen as something given at the 
outset, but the predicates must be formed through logical operations, 
and the rules of construction determine the domain of predicates only 
afterwards. 

From this we see that in the rules of logical construction of predicates 
reference to the domain of predicates cannot be allowed. For otherwise 
a circulus vitiosus would result.^ 

Here Hilbert is echoing the predicativist worries of Poincare and Weyl. However, 
Hilbert rejects Weyl's answer to the problem, viz., restricting mathematics to 
predicatively acceptable constructions and inferences, as unacceptable in that 
it amounts to "a return to the prohibition policies of Kronecker." Russell's 
proposed solution, on the other hand, amounts to giving up the aim of reduction 
to logic: 

Russell starts with the idea that it suffices to replace the predicate needed 
for the definition of the union set by one that is extensionally equivalent, 
and which is not open to the same objections. He is unable, however, 
to exhibit such a predicate, but sees it as obvious that such a predicate 
exists. It is in this sense that he postulates the "axiom of reducibility," 
which states approximately the following: "For each predicate, which is 
formed by referring (once or multiple times) to the domain of predicates. 
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there is an extensionally equivalent predicate, which does not make such 
reference. 

With this, however, Russell returns from constructive logic to the 
axiomatic standpoint. [. . . ] 

The aim of reducing set theory, and with it the usual methods of 
analysis, to logic, has not been achieved today and maybe caimot be 
achieved at all."* 

With this, Hilbert rejects the logicist position as failed. At the same time, he 
rejects the restrictive positions of Brower, Weyl, and Kronecker. The axiomatic 
method provides a framework which can accommodate the positive contributions 
of Brouwer and Weyl, without destroying mathematics through a Kroneckerian 
"politics of prohibitions." For Hilbert, the unfettered progress of mathematics, 
and science in general, is a prime concern. This is a position that Hilbert 
had already stressed in his lectures before the 1900 and 1904 International 
Congresses of Mathematics, and which is again of paramount importance for 
him with the conversion of Weyl to Brouwer's intuitionism. 

Naturally, the greater freedom comes with a price attached: the axiomatic 
method, in contrast to a foundation based on logical principles alone, does 
not itself guarantee consistency. Thus, a proof of consistency is needed. 

2. Early Consistency Proofs 

Ever since his work on geometry in the 1890s, Hilbert had an interest in 
consistency proofs. The approaches he used prior to the foundational program 
of the 1920s were almost always relative consistency proofs. Various axiomatic 
systems, from geometry to physics, were shown to be consistent by giving 
arithmetical (in a broad sense, including arithmetic of the reals) interpretations 
for these systems, with one exception — a prototype of a finitistic consistency 
proof for a weak arithmetical system in Hilbert (1905). This was Hilbert's 
first attempt at a "direct" consistency proof for arithmetic, i.e., one not based 
on a reduction to another system, which he had posed as the second of his 
famous list of problems (Hilbert, 1900). 

When Hilbert once again started working on foundational issues following 
the war, the first order of business was a formulation of logic. This was 
accomplished in collaboration with Bernays between 1917 and 1920 (see 
Sieg, 1999 and Zach, 1999), included the establishment of metatheoretical 
results Uke completeness, decidability, and consistency for propositional logic 
in 1917/18, and was followed by ever more nuanced axiom systems for propositional 
and predicate logic. This first work in purely logical axiomatics was soon 
extended to include mathematics. Here Hilbert followed his own proposal, 
made first in 1905,^ to develop mathematics and logic simultaneously. The 
extent of this simultaneous development is nowhere clearer than in Hilbert's 
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lecture course of 1921/22, where the 8-operator is first used as both a logical 
notion, representing the quantifiers, and an arithmetical notion, representing 
induction in the form of the least number principle. Hilbert realized then that 
a consistency proof for all of mathematics is a difficult undertaking, best 
attempted in stages: 

Considering the great variety of coimectives and interdependencies exhibited 
by arithmetic, it is obvious from the start that we will the not be able to 
solve the problem of proving consistency in one fell swoop. We will 
instead first consider the simplest connectives, and then proceed to ever 
higher operations and inference methods, whereby consistency has to 
be established for each extension of the system of signs and inference 
rules, so that these extensions do not endanger the consistency [result] 
established in the preceding stage. 

Another important aspect is that, following our plan for the complete 
formalization of arithmetic, we have to develop the proper mathematical 
formalism in connection with the formaUsm of the logical operations, so 
that — as I have expressed it — a simultaneous construction of mathematics 
and logic is executed.^ 
Hilbert had rather clear ideas, once the basic tools both of proof and of 
formaUzation were in place, of what the stages should be. In an addendum 
to the lecture course on Grundlagen der Mathematik, taught by Hilbert and 
Bemays in 1922-23,^ he outlined them. The first stage had already been 
accomplished: Hilbert gave consistency proofs for calculi of propositional 
logic in his 1917/18 lectures. Stage n consist in the elementary calculus of 
free variables, plus equality axioms and axioms for successor and predecessor. 
The axioms are: 



I. A^B^A 2. 
3. {A^B^C)^{B^A^C) 4. 
5. A&B^A 6. 
7. A~^B-^A&B 8. 
9. B^AyB 10. 

II. A^A^B 12. 
13. a = a 14. 
15. a + l^O 16. 



{A^A-^B)^A-^B 
{B^C)^{A^B)^A^C 
A&B^B 

(A^C) ^ {B^C) ^AVB^C 
{A^B)^{A^B)^B 
a = b^A{a) ^ A{b) 
h{a+\)=a^ 



In Hubert's systems, Latin letters are variables; in particular, a, b, c, . . . , 
are individual variables and A, B, C, . . . , are formula variables. The rules 
of inference are modus ponens and substitution for individual and formula 
variables. 

Hilbert envisaged his foundational project as a stepwise "simultaneous 
development of logic and mathematics," in which axiomatic systems for logic, 
arithmetic, analysis, and finally set theory would be developed. Each stage 
would require a proof of consistency before the next stage is developed. In a 
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handwritten supplement to the typescript of the 1922-23 lecture notes on the 
foundations of arithmetic, Hilbert presents a rough overview of what these 
steps might be: 

Outline. Stage II was elementary calculation, axioms I-I6. 

Stage III. Now elementary number theory 
Schema for definition of functions by recursion and modus ponens 
will add the schema of induction to modus ponens 
even if this coincides in substance with the results of intuitively obtained 
number theory, we are now deahng with formulas, e.g, a + b = b + a. 

Stage rm. Transfinite inferences and parts of analysis 

Stage V. Higher-order variables and set theory. Axiom of choice. 

Stage VI. Numbers of the 2nd number class, full transfinite induction. 
Higher types. Continuum problem, transfinite induction for numbers in 
the 2nd number class. 

Stage VII. (1) Replacement of infinitely many definitional schemata 
by one axiom. (2) Analysis and set theory. At level 4, again the full 
theorem of the least upper bound. 

Stage Vm. FormaUzation of well ordering.^ 

2.1. The propositional calculus and the calculus of 
elementary computation 

Step I had been achieved in 1917-18. Already in the lectures from the Winter 
term 1917/18, Hilbert and Bemays had proved that the propositional calculus 
is consistent. This was done first by providing an arithmetical interpretation, 
where they stressed that only finitely many numbers had to be used as "values" 
(0 and I). The proof is essentially a modem proof of the soundness of propositional 
logic: A truth value semantics is introduced by associating which each formula 
of the propositional calculus a truth function mapping tuples of and 1 
(the values of the propositional variables) to or 1 (the truth value of the 
formula under the corresponing valuation). A formula is called correct if it 
corresponds to a truth function which always takes the value 1. It is then 
showed that the axioms are correct, and that modus ponens preserves correctness. 
So every formula derivable in the propositional calculus is correct. Since 
A and A caimot both be correct, they caimot both be derivable, and so the 
propositional calculus is consistent. 

It was very important for Hilbert that the model for the propositional 
calculus thus provided by {0,1} was finite. As such, its existence, and the 
admissibility of the consistency proof was beyond question. This lead him to 
consider the consistency proof for the propositional calculus to be the prime 
example for for a consistency proof by exhibition in his 1921/22 lectures 
on the foundations of mathematics. The consistency problem in the form 
of a demand for a consistency proof for an axiomatic system which neither 
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proceeds by exhibiting a model, nor by reducing consistency of a system to 
the consistency of another, but by providing a metamathematical proof that 
no derivation of a contradiction is possible, is first formulated in lectures in 
the Summer term of 1920. Here we find a first formulation of an arithmetical 
system and a proof of consistency. The system consists of the axioms 

1 = 1 

{a = b) (a + 1 =^7+1) 
(a+1 = ^+1) {a = b) 

{a = b) {{a = c)^{b = c)). 

The notes contain a proof that these four axioms, together with modus ponens, 
do not allow the derivation of the formula 



a+1 = 1. 

The proof itself is not too interesting, and 1 will not reproduce it here.''^ The 
system considered is quite weak. It does not even contain all of propositional 
logic: negation only appears as inequality, and only formulas with at most two 
signs are derivable. Not even a = a is derivable. It is here, nevertheless, 
that we find the first statement of the most important ingredient of Hilbert's 
project, namely, proof theory: 

Thus we are led to make the proofs themselves the object of our investigation; 
we are urged towards a proof theory, which operates with the proofs 
themselves as objects. 

For the way of thinking of ordinary number theory the numbers are 
then objectively exhibitable, and the proofs about the numbers already 
belong to the area of thought. In our study, the proof itself is something 
which can be exhibited, and by thinking about the proof we arrive at the 
solution of our problem. 

Just as the physicist examines his apparatus, the astronomer his position, 
just as the philosopher engages in critique of reason, so the mathematician 
needs his proof theory, in order to secure each mathematical theorem by 
proof critique. 

This project is developed in earnest in two more lecture courses in 1921- 
22 and 1922-23. These lectures are important in two respects. First, it is here 
that the axiomatic systems whose consistency is to be proven are developed. 
This is of particular interest for an understanding of the relationship of Hilbert 
to Russell's project in the Principia and the influence of Russell's work both 
on Hilbert's philosophy and on the development of axiomatic systems for 
mathematics.^^ Sieg (1999) has argued that, in fact, Hilbert was a logicist 
for a brief period around the time of his paper "Axiomatic Thought" (Hilbert, 
1918a). However, as noted in Section 1, Hilbert soon became critical of Russell's 
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type theory, in particular of the axiom of reducibility. Instead of taking the 
system of Principia as the adequate formalization of mathematics the consistency 
of which was to be shown, Hilbert proposed a new system. The guiding 
principle of this system was the "simultaneous development of logic and 
mathematics" — as opposed to a development of mathematics out of logic — 
which he had already proposed in Hilbert (1905, 176). The cornerstone of this 
development is the 8-calculus. The second major contribution of the 1921- 
22 and 1922-23 lectures are the consistency proofs themselves, including 
the Hilbertsche Ansatz for the e-substitution method, which were the direct 
precursors to Ackermann's dissertation of 1924. 

In contrast to the first systems of 1920, here Hilbert uses a system based on 
full propositional logic with axioms for equality, i.e., the elementary calculus 
of free variables: 

I. Logical axioms 

a) Axioms of consequence 
\)A^B-^A 

2) {A^A^B)^A^B 

3) Ia-^B-^C)-^B^A^C 

4) {B^C)^{A^B)^A^C 

b) Axioms of negation 

5) A^A-^B _ 

6) {A-^B)^{A^B)^B 
n. Arithmetical axioms 

a) Axioms of equality 

7) a = a 

8) a = b^Aa^Ab 

b) Axioms of number 

9) a + l 7^0 

10) 5(a + l) =fli3 

Here, ' + 1 ' is a unary function symbol. The rules of inference are substitution 
(for individual and formula variables) and modus ponens. 

Hubert's idea for how a finitistic consistency proof should be carried out 
is first presented here. The idea is this: suppose a proof of a contradiction is 
available. We may assume that the end formula of this proof is 7^ 0. 

1 . Resolution into proof threads. First, we observe that by duplicating part 
of the proof and leaving out steps, we can transform the derivation to one 
where each formula (except the end formula) is used exactly once as the 
premise of an inference. Hence, the proof is in tree form. 

2. Elimination of variables. We transform the proof so that it contains no 
free variables. This is accomplished by proceeding backwards from the 
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end formula: The end formula contains no free variables. If a formula 
is the conclusion of a substitution rule, the inference is removed. If a 
formula is the conclusion of modus ponens it is of the form 

where 05' results from *B by substituting terms for free variables. If these 
variables also occur in 21, we substitute the same terms for them. Variables 
in 21 which do not occur in 03 are replaced with 0. This yields a formula 
21' not containing variables.^"* The inference is replaced by 

21' 21' ^ 03' 
03' 

3. Reduction of junctionals. The remaining derivation contains a number of 
terms (junctionals in Hilbert's parlance) which now have to be reduced 
to numerical terms (i.e., standard numerals of the form (...(0+ 1) + 
• • •) + 1). In this case, this is done easily by rewriting innermost subterms 
of the form 5(0) by and 8(n+ I) by n. In later stages, the set of 
terms is extended by function symbols introduced by recursion, and the 
reduction of functionals there proceeds by calculating the function for 
given numerical arguments according to the recursive definition. This will 
be discussed in the next section. 

In order to establish the consistency of the axiom system, Hilbert suggests, 
we have to find a decidable {konkret jeststellbar) property of formulas so that 
every formula in a derivation which has been transformed using the above 
steps has the property, and the formula 0^0 lacks it. The property Hilbert 
proposes to use is correctness. This is not to be understood as truth in a model. 
The formulas still occurring in the derivation after the transformation are all 
Boolean combinations of equations between numerals. An equation between 
numerals n = m is correct if n and m are syntactically equal, and the negation 
of an equality is correct of m and n are not syntactically equal. 

If we call a formula which does not contain variables or functionals other 
than numerals an ''explicit [numerical] formula", then we can express 
the result obtained thus: Every provable explicit [numerical] formula is 
end formula of a proof all the formulas of which are explicit formulas. 

This would have to hold in particular of the formula 7^ 0, if it 
were provable. The required proof of consistency is thus completed if 
we show that there can be no proof of the formula which consists of 
only explicit formulas. 

To see that this is impossible it suffices to find a concretely determinable 
[konkret feststellbar] property, which first of all holds of all explicit 
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formulas which result from an axiom by substitution, which furthermore 
transfers from premises to end formula in an inference, which however 
does not apply to the formula 7^ 0.^^ 

Hilbert now defines the notion of a (conjunctive) normal form and gives a 
procedure to transform a formula into such a normal form. He then provides 
the wanted property: 

With the help of the notion of a normal form we are now in a position 
to exhipit a property which distinguishes the formula 0^0 from the 
provable expUcit formulas. 

We divide the explicit formulas into "correct" and "incorrect." The 
expUcit atomic formulas are equations with numerals on either side [of 
the equality symbol]. We call such an equation correct, if the numerals 
on either side coincide, otherwise we call it incorrect. We call an inequality 
with numerals on either side correct if the two numerals are different, 
otherwise we call it incorrect. 

In the normal form of an arbitrary explicit formula, each disjunct has 
the form of an equation or an inequality with numerals on either side. 

We now call a general explicit formula correct if in the corresponding 
normal form each disjunction which occurs as a conjunct (or which 

constitutes the normal form) contains a correct equation or a correct 
inequality. Otherwise we call the formula incorrect. [. ..] 

According to this definition, the question of whether an explicit formula 
is correct or incorrect is concretely decidable in every case. Thus the 
"tertium non datur" holds here. . . 

This use in the 1921-22 lectures of the conjunctive normal form of a propositional 
formula to define correctness of Boolean combinations of equalities between 
numerals goes back to the 1917-18 lecture notes,'^ where transformation 
into conjunctive normal form and testing whether each conjunct contains 
both A and A was proposed as a test for propositional validity. Similarly, 
here a formula is correct if each conjunct in its conjunctive normal form 
contains a correct equation or a correct inequality. In the 1922-23 lectures, 
the definition involving conjunctive normal forms is replaced by the usual 
inductive definition of propositional truth and falsehood by truth tables (Hilbert 
and Bemays 1923a, 21). Armed with the definition of correct formula, Hilbert 
can prove that the derivation resulting from a proof by transforming it according 
to (l)-(3) above contains only correct formulas. Since 7^ is plainly not 
correct, there can be no proof of 7^ in the system consisting of axioms 
(I)-(IO). The proof is a standard induction on the length of the derivation: the 
formulas resulting from the axioms by eUmination of variables and reduction 
of functionals are all correct, and modus ponens preserves correctness.^^ 



10 



2.2. Elementary number theory with recursion and 

INDUCTION rule 

The system of stage III consists of the basic system of the elementary calculus 
of free variables and the successor function, extended by the schema of defining 
functions by primitive recursion and the induction rule.^*^ A primitive recursive 
definition is a pair of axioms of the form 

(p{0,bi,...,bn) = a{bi,...,bn) 
((>{a+l,bu...,b„) = b{a,(^{a),bu...,b„) 

where a{b\ , . . . ,bn) contains only the variables bi, . .. ,bn, and b{a,c,bi ,bn) 
contains only the variables a, c, bi, bn- Neither contains the function 
symbol (p or any function symbols which have not yet been defined. 

The introduction of primitive recursive definitions and the induction rule 
serves, first of all, the purpose of expressivity. Surely any decent axiom system 
for arithmetic must provide the means of expressing basic number-theoretic 
states of affairs, and this includes addition, subtraction, multiplication, division, 
greatest common divisor, etc. The general schema of primitive recursion is 
already mentioned in the Kneser notes for 1921-22 (Hilbert, 1922a, Heft 
n, 29), and is discussed in some detail in the notes for the lectures of the 
following year (Hilbert and Bernays, 1923a, 26-30). 

It may be interesting to note that in the 1922-23 lectures, there are no 
axioms for addition or multiplication given before the general schema for 
recursive definition. This suggests a change in emphasis during 1922, when 
Hilbert realized the importance of primitive recursion as an arithmetical concept 
formation. He later continued to develop the notion, hoping to capture all 
number theoretic functions using an extended notion of primitive recursion 
and to solve the continuum problem with it. This can be seen from the attempt 
at a proof of the continuum hypothesis in (1926), and Ackermann's paper on 
"Hubert's construction of the reals" (1928b), which deals with hierarchies 
of recursive functions. The general outlook in this regard is also markedly 
different from Skolem's 1923, which is usually credited with the definition of 
primitive recursive arithmetic.'^^ 

Hilbert would be remiss if he would not be including induction in his 
arithmetical axiom systems. As he already indicates in the 1921-22 lectures, 
however, the induction principle cannot be formulated as an axiom without 
the help of quantifiers. 

We are still completely missing the axiom of complete induction. One 
might think it would be 



{Z{a) (A(a) ^A{a + m ^ {A(I) ^ {Z{b) ^Aib))} 
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That is not it, for take a = 1. The hypothesis must hold for all a. We have, 
however, no means to bring the all into the hypothesis. Our formalism 
does not yet suffice to write down the axiom of induction. 

But as a schema we can: We extend our methods of proof by the 
following schema. 

.^(1) K{a)-^K{a + l) 
Z(a) ^ R{a) 

Now it makes sense to ask whether this schema can lead to a contradiction.^^ 

The induction schema is thus necessary in the formulation of the elementary 
calculus only because quantifiers are not yet available. Subsequently, induction 
will be subsumed in the 8-calculus. 

The consistency proof for stage II is extended to cover also the induction 
schema and primitive recursive definitions. Both are only sketched: Step (3), 
reduction of functionals, is extended to cover terms containing primitive recursive 
functions by recursively computing the value of the innermost term containing 
only numerals. Both in the 1921-22 and the 1922-23 sets of notes by Kneser, 
roughly a paragraph is devoted to these cases (the official sets of notes for 
both lectures do not contain the respective passages). 

How de we proceed for recursions? Suppose a 9(3) occurs. Either [3 is] 
0, then we replace it by a. Or [it is of the form] 9(3 + 1): [replace it with] 
b(3,cp(3)). Claim: These substitutions eventually come to an end, if we 
replace innermost occurrences first.^^ 

The claim is not proved, and there is no argument that the process terminates 
even for terms containing several different, nested primitive recursively defined 
function symbols. 

For the induction schema, Hilbert hints at how the consistency proof must 
be extended. Combining elimination of variables and reduction of functionals 
we are to proceed upwards in the proof as before until we arrive at an instance 
of the induction schema: 

.^(1) ^{a)^^{a+l) 
Z(3)^^'(3) 

By copying the proof ending in the right premise, substituting numerals 1, 
t) (where 3 = r) + 1) for a and applying the appropriate substitutions to 
the other variables in ^ we obtain a proof of Z{a) ^'{i) without the last 
application of the induction schema. 

With the introduction of the 8-calculus, the induction rule is of only minor 
importance, and its consistency is never proved in detail until Hilbert and 
Bemays (1934, 298-99). 
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2.3. The 8-calculus and the axiomatization of mathematics 

In the spirit of the "simultaneous development of logic and mathematics," 
Hilbert takes the next step in the axiomatization of arithmetic by employing 
a principle taken from Zermelo's axiomatization of set theory: the axiom of 
choice. Hilbert and Bernays had dealt in detail with quantifiers in lectures in 
1917-18 and 1920, but they do not directly play a significant role in the axiom 
systems Hilbert develops for mathematics. Rather, the first- and higher-order 
calculi for which consistency proofs are proposed, are based instead on choice 
functions. The first presentation of these ideas can be found in the 1921-22 
lecture notes by Kneser (the official notes do not contain these passages). The 
motivation is that in order to deal with analysis, one has to allow definitions 
of functions which are not finitary. These concept formations, necessary for 
the development of mathematics free from intuitionist restrictions, include 
definition of functions from undecidable properties, by unbounded search, 
and choice. 

Not finitely (recursively) defined is, e.g., (p(a) = if there is a so that 
+ ab^ + 7 is prime, and = 1 otherwise. But only with these numbers 
and functions the real mathematical interest begins, since the solvability 
in finitely many steps is not foreseeable. We have the conviction, that 
such questions, e.g., the value of (p(a), are solvable, i.e., that (p(a) is also 
finitely definable. We cannot wait on this, however, we must allow such 
definitions for otherwise we would restrict the free practice of science. 
We also need the concept of a function of functions.^ 

The concepts which Hilbert apparently takes to be fundamental for this project 
are the principle of the excluded middle and the axiom of choice, in the form 
of second-order functions x and a. The axioms for these functions are 

1. T(/)=0^(Z(a)^/(a) = l) 

2. T(/)^O^Z(a(/)) 

3. T(/)^0^/(a(/))/l 

4. T(/)^0^T(/) = 1 

The intended interpretation is: t(/) = if / is always 1 and = 1 if one can 
choose an a(/) so that /(a(/)) / 1. 

The introduction of 1 and a allows Hilbert to replace universal and existential 
quantifiers, and also provides the basis for proofs of the axiom of induction 
and the least upper bound principle. Furthermore, Hilbert claims, the consistency 
of the resulting system can be seen in the same way used to establish the 
consistency of stage HI (primitive recursive arithmetic). From a proof of 
a numerical formula using x's and a's, these terms can be eliminated by 
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finding numerical substitutions which turn the resulting formulas into correct 
numerical formulas. 

These proofs are sketched in the last part of the 1921-22 lecture notes by 
Kneser (Hilbert, 1922a).^^ In particular, the consistency proof contains the 
entire idea of the Hilbertsche Ansatz, the 8-substitution method: 

First we show that we can eliminate all variables, since here also only 
free variables occur. We look for the innermost x and a. Below these 
there are only finitely defined [primitive recursive] functions (p, (p'. Some 
of these functions can be substituted for / in the axioms in the course 
of the proof. 1: T((p) = ^ (■^('i) = 1)' where a is a functional. 

If this is not used, we set all a((p) and T(cp) equal to zero. Otherwise 
we reduce a and (p(a) and check whether Z(a) — > (p(a) = 1 is correct 
everywhere it occurs. If it is correct, we set T[(p] = 0, a[(p] = 0. If it is 
incorrect, i.e., if o = 3, 9(3) 7^ 1, we let T((p) = 1, a((p) = 3. After this 
replacement, the proof remains a proof. The formulas which take the 
place of the axioms are correct. 

(The idea is: if a proof is given, we can extract an argument from it 
for which (p = 1.) In this way we eliminate the T and a and applications 
of [axioms] (l)-(4) and obtain a proof of 1 7^ 1 from I-V and correct 
formulas, i.e., from I-V, 



x{f,b)=0 ^ {Z{a)^f{a,b) = l} 

%{f,b)j^O ^ Z{f{a,b)) 

T{f,b)^0 ^ f{a{f,b),b)^l 

%{f,b)^0 ^ %{f,b) = l^^ 



Although not formulated as precisely as subsequent presentations, all the 
ingredients of Hilbert's 8-substitution method are here. The only changes that 
are made en route to the final presentation of Hilbert's sketch of the case for 
one 8 and Ackermann's are mostly notational. In (Hilbert, 1923), a talk given 
in September 1922, the two functions 1 and a are merged to one function 
(also denoted x), which in addition provides the least witness for x(f{a)) ^ 1. 
There the x function is also applied directly to formulas. In fact, XaA(fl) is the 
primary notion, denoting the least witness a for which A{a) is false; x(/) is 
defined as Xa{.f{p) = 0). Interestingly enough, the sketch given there for the 
substitution method is for the x-function for functions, not formulas, just as it 
was in the 1921-22 lectures. 

The most elaborate discussion of the 8-calculus can be found in Hilbert's 
and Bernays's course of 1922-23. Here, again, the motivation for the 8-function 
is Zermelo's axiom of choice: 

What are we missing? 
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1. As far as logic is concerned: we have had the propositional calculus 
extended by free variables, i.e., variables for which arbitrary functionals 
may be substituted. Operating with "all" and "there is" is still missing. 

2. We have added the induction schema, but without consistency proof 
and also on a provisional basis, with the intention of removing it. 

3. So far only the arithmetical axioms which refer to whole numbers. 
The above shortcomings prevent us from building up analysis (limit 
concept, irrational number). 

These 3 points already give us a plan and goals for the following. 

We turn to (1). It is clear that a logic without "all' — "there is" would 
be incomplete, 1 only recall how the application of these concepts and of 
the so-called transfinite inferences has brought about major problems. 
We have not yet addressed the question of the applicability of these 
concepts to infinite totalities. Now we could proceed as we did with the 
propositional calculus: Formulate a few simple [principles] as axioms, 
from which all others follow. Then the consistency proof would have to 
be carried out — according to our general program: with the attitude that 
a proof is a figure given to us. Significant obstacles to the consistency 
proof because of the bound variables. The deeper investigation, however, 
shows that the real core of the problem lies at a different point, to which 
one usually only pays attention later, and which also has only been 
noticed in the literature of late.^^ 
At the corresponding place in the Kneser Mitschrift, Hilbert continues: 

[This core lies] in Zermelo's axiom of choice. [.. .] The objections [of 
Brouwer and Weyl] are directed against the choice principle. But they 
should likewise be directed against "all" and "there is", which are based 
on the same basic idea. 

We want to extend the axiom of choice. To each proposition with a 
variable A (a) we assign an object for which the proposition holds only 
if is holds in general. So, a counterexample, if one exists. 

8(A), an individual logical function. [. . . ] 8 satisfies the transfinite 
axiom: 

(16) A(8A) ^Aa 

e.g., Aa means: a is corrupt. sA is Aristides.^^ 

Hilbert goes on to show how quantifiers can be replaced by e-terms. The 
corresponding definitional axioms are already included in Hilbert (1923), i.e., 
A(8A) = (a)A(a) and A(8A) = (3a)A(a). Next, Hilbert outlines a derivation 
of the induction axioms using the 8-axioms. For this, it is necessary to require 
that the choice function takes the minimal value, which is expressed by the 
additional axiom 

eA7^0^A(5(8A)). 
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With this addition, Hilbert combined the K function of (Hilbert, 1922c) and 
the n function of (1923) with the 8 function. Both K ("k" for Kleinstes, least) 
and ^ had been introduced there as functions of functions giving the least 
value for which the function differs from 0. In Hilbert (1923, 161-162), 
Hilbert indicates that the axiom of induction can be derived using the function, 
and credits this to Dedekind (1888). 

The third issue Hilbert addresses is that of dealing with real numbers, and 
extending the calculus to analysis. A first step can be carried out at stage IV by 
considering a real number as a function defining an infinite binary expansion. 
A sequence of reals can then be given by a function with two arguments. 
Already in Hilbert (1923) we find a sketch of the proof of the least upper 
bound principle for such a sequence of reals, using the % function: 



The general case of sets of reals needs function variables and second-order 8 
and 71. These are briefly introduced as 8/A with the axioms 



The last two lectures transcribed in (Hilbert and Bemays, 1923a) are devoted 
to a sketch of the 8 substitution method. The proof is adapted from Hilbert 
(1923), replacing tf with zA, also deals with %, and covers the induction 
axiom in its form for the 8-calculus.^^ During the last lecture, Bemays also 
extends the proof to second-order 8's. 

If we have a function variable: 



[. . . ] Suppose e only occurs with 21 (e.g., /O = 0, //O = 0). How will we 
eliminate the function variables? We simply replace fc by c. This does 
not apply to bound variables. For those we take some fixed function, e.g., 
5 and carry out the reduction with it. Then we are left with, e.g., 215 — > 
2lcp. This, when reduced, is either correct or incorrect. In the latter case, 
2l(p is incorrect. Then we substitute cp everywhere for 8/21/. Then we 
end up with 2l(p 2t\|/. That is certainly correct, since 2l(p is incorrect. 
The last development regarding the £-calculus before Ackermann's dissertation 
is the switch to the dual notation. Both (Hilbert, 1923) and (Hilbert and 
Bemays, 1923a) use 8A as denoting a counterexample for A, whereas at least 
from Ackermann's dissertation onwards, 8A denotes a witness. Correspondingly, 
Ackermann uses the dual axiom A{a) A(8aA(a)). Although it is relatively 




A8/A ^ Af 
(j)Af ^ ji/A/ = 
(7)A/ ^ 7I/A/=1 



Ae/A/^A/ 



16 



clear that the supplement to (Hilbert and Bernays, 1923a) — 24 sheets in Hilbert's 
hand — are Hilbert's notes based on which he and partly Bernays presented 
the 1922-23 lectures, parts of it seem to have been altered or written after the 
conclusion of the course. Sheets 12-14 contain a sketch of the proof of the 
axiom of induction from the standard, dual e axioms; the same proof for the 
original axioms can be found on sheets 8-11. 

This concludes the development of mathematical systems using the s- 
calculus and consistency proofs for them presented by Hilbert himself. We 
now turn to the more advanced and detailed treatment in Wilhelm Ackermann's 
(1924b) dissertation. 



3. Ackermann's Dissertation 

Wilhelm Ackermann was born in 1896 in Westphalia. He studied mathematics, 
physics, and philosophy in Gottingen between 1914 and 1924, serving in the 
army in World War I from 1915-1919. He completed his studies in 1924 with 
a dissertation, written under Hilbert, entitled "Begriindung des 'tertium non 
datur' mittels der Hilbertschen Theorie der Widerspruchsfreiheit" (Acker- 
mann, 1924a; 1924b), the first major contribution to proof theory and Hilbert's 
Program. In 1927 he decided for a career as a high school teacher rather than 
a career in academia, but remained scientifically active. His major contri- 
butions to logic include the function which carries his name — an example 
of a recursive but not primitive recursive function (Ackermann, 1928b), the 
consistency proof for arithmetic using the 8-substitution method (Ackermann, 
1940), and his work on the decision problem (Ackermann, 1928a; 1954). He 
served as co-author, with Hilbert, of the influential logic textbook Grundzuge 
der theoretischen Logik (Hilbert and Ackermann, 1928). He died in 1962.^^ 

Ackermann's 1924 dissertation is of particular interest since it is the first 
non-trivial example of what Hilbert considered to be a finitistic consistency 
proof. Von Neumann's paper of 1927 does not entirely fit into the tradition of 
the Hilbert school, and we have no evidence of the extent of Hilbert's involve- 
ment in its writing. Later consistency proofs, in particular those by Gentzen 
and Kalmar, were written after Godel's incompleteness results were akeady 
well-known and their implications understood by proof theorists. Ackermann's 
work, on the other hand, arose entirely out of Hilbert's research project, and 
there is ample evidence that Hilbert was aware of the range and details of 
the proof. Hilbert was Ackermann's dissertation advisor, approved the thesis, 
was editor of Mathematische Annalen, where the thesis was published, and 
corresponded with Ackermann on corrections and extensions of the result. 
Ackermann was also in close contact with Paul Bernays, Hilbert's assistant 
and close collaborator in foundational matters. Ackermann spent the first half 
of 1925 in Cambridge, supported by a fellowship from the International Ed- 
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ucation Board (founded by John D. Rockefeller, Jr., in 1923). In his letter of 
recommendation for Ackermann, Hilbert writes: 

In his thesis "Foundation of the 'tertium non datur' using Hilbert's the- 
ory of consistency," Ackermann has shown in the most general case that 
the use of the words "all" and "there is," of the "tertium non datur," is 
free from contradiction. The proof uses exclusively primitive and finite 
inference methods. Everything is demonstratded, as it were, directly on 
the mathematical formahsm. 

Ackermann has here surmounted considerable mathematical diffi- 
culties and solved a problem which is of first importance to the modem 
efforts directed at providing a new foundation for mathematics.-'^ 
Further testimony of Hilbert's high esteem for Ackermann can be found in 
the draft of a letter to Russell asking for a letter of support to the International 
Education Board, where he writes that "Ackermann has taken my classes on 
foundations of mathematics in recent semesters and is currently one of the 
best masters of the theory which 1 have developed here."^^ 

Ackermann's work provides insight into two important issues relating to 
Hilbert's program as it concerns finitistic consistency proofs. First, it provides 
historical insight into the aims and development of Hilbert's Program: The 
first part of the program called for an axiomatization of mathematics. These 
axiomatizations were then the objects of metamathematical investigations: 
the aim was to find finitistic consistency proofs for them. Which areas of 
mathematics were supposed to be covered by the consistency proofs, how 
were they axiomatized, what is the strength of the systems so axiomatized? 
We have already seen what Hilbert's roadmap for the project of axiomatiza- 
tion was. Ackermann's dissertation provides the earliest example of a formal 
system stronger than elementary arithmetic. The second aim, the metamath- 
ematical investigation of the formal systems obtained, also poses historical 
questions: When did Ackermann, and other collaborators of Hilbert (in par- 
ticular, Bemays and von Neumann) achieve the results they sought? Was 
Ackermann's proof correct, and if not, what parts of it can be made to work? 

The other information we can extract from an analysis of Ackermann's 
work is what methods were used or presupposed in the consistency proofs 
that were given, and thus, what methods were sanctioned by Hilbert himself 
as falling under the finitist standpoint. Such an analysis of the methods used 
are of a deeper, conceptual interest. There is a fundamental division between 
Hilbert's philosophical remarks on finitism on the one hand, and the professed 
goals of the program on the other. In these comments, rather little is said 
about the concept formations and proof methods that a finitist, according to 
Hilbert, is permitted to use. In fact, most of Hilbert's remarks deal with the 
objects of finitism, and leave the finitistically admissible forms of definition 
and proof to the side. These, however, are the questions at issue in contempo- 
rary conceptual analyses of finitism. Hilbert's relative silence on the matter 
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is responsible for the widespread — and largely correct — opinion that Hilbert 
was too vague on the question of what constitutes finitism to unequivocally 
define the notion, and therefore later commentators have had enough leeway 
to disagree widely on the strength of the finitist standpoint while still claiming 
to have explicated Hilbert' s own concept. 

3.1. Second-order primitive recursive arithmetic 

In Ackermann (1924b), the system of stage III is extended by second-order 
variables for functions. The schema of recursive definition is then extended 
to include terms containing such variables. In the following outline, I shall 
follow Ackermann and adopt the notation of subscripting function symbols 
and terms by variables to indicate that these variables do not occur freely but 
rather as placeholders for functions. For instance, Oa(/(a)) indicates that the 
term a does not contain the variable a free, but rather that the function /(a) 
appears as a functional argument, i.e., that the term is of the form a(ka.f{a)). 
The schema of second-order primitive recursion is the following: 



To clarify the subscript notation, compare this with the schema of second- 
order primitive recursion using ^.-abstraction notation: 



Using this schema, it is possible to define the Ackermann function. This was 
already pointed out in Hilbert (1926), although it was not until Ackermann 
(1928b) that it was shown that the function so defined cannot be defined by 
primitive recursion without function variables. Ackermann (1928b) defines 
the function as follows. First it is observed that the iteration function 



(Pg_.(0,/(BO,?) = ag^(/(BO,?) 



(f>{a + \,Xbi.f(bi),c) = b{aMaMi-f{di),c),Xbi.f{bi)) 




can be defined by second-order primitive recursion: 



Pc{OJic),b) = b 
Pcia + l,f{c),b) = f{pcia,f{c),b)) 



Furthermore, we have two auxiliary functions 




and 
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which are primitive recursive, as well as addition and multiplication. The term 
a{a, b) is short for i(a, 1) -1(0, 0) ■ ^7 + 1); we then have 

rO ifa = 
a{a,b) = l\ ifa = 1 
I b otherwise 

The Ackermann function is defined by 

cp(0,Z7,c) = b + c 
(p(a+l,&,c) = Prf(c,(p(a,Z?,i/),a(a,Z?)). 

In more suggestive terms, 

9(0, ^7,c) = b + c 
9(1, ^7,c) = be 
(p(a+l,^7,c) = (p(a,^7,(p(a,fc,...(p(a,&,fe)---)) 

c times 

The system of second-order primitive recursive arithmetic 2PRA^ used in 
Ackermann (1924b) consists of axioms (1)-(15) of Hilbert and Bemays (1923b, 
see Section 2), axiom (16) was replaced by 

16. a^Q ^ a = 8(a) + 1, 

plus defining equations for both first- and second-order primitive recursive 
functions. There is no induction rule (which is usually included in systems 
of primitive recursive arithmetic), although the consistency proof given by 
Ackermann can easily be extended to cover it. 

3.2. The consistency proof for primitive recursive 
arithmetic 

Allowing primitive recursion axioms for functions which contain function 
variables is a natural extension of the basic calculus of stages III and IIII, and 
is necessary in order to be able to introduce sufficiently complex functions. 
Hilbert seems to have thought that by extending primitive recursion in this 
way, or at least by building an infinite hierarchy of levels of primitive recur- 
sions using variables of higher types, he could account for all the number 
theoretic functions, and hence for all real numbers (represented as decimal 
expansions). In the spirit of the stage-by-stage development of systems of 
mathematics and consistency proofs, it is of course necessary to show the 
consistency of the system of stage IIII, which is the system presented by 
Ackermann. As before, it makes perfect sense to first establish the consistency 
for the fragment of stage mi not containing the transfinite 8 and 7t functions. 
In Section 4 of his dissertation, Ackermann undertakes precisely this aim. 
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The proof is a direct extension of the consistency proof of stage III, the el- 
ementary calculus of free variables with basic arithmetical axioms and prim- 
itive recursive definitions, i.e., PRA. This proof had already been presented 
in Hubert's lectures in 1921-22 and 1922-23. The idea here is the same: 
put a given, purported proof of / into tree form, eliminate variables, and 
reduce functionals. The remaining figure consists entirely of correct formulas, 
where correctness of a formula is a syntactically defined and easily decidable 
property. The only complication for the case where function variables are 
also admitted is the reduction of functionals. It must be shown that every 
functional, i.e., every term of the language, can be reduced to a numeral on 
the basis of the defining recursion equations. For the original case, this could 
be done by a relatively simple inductive proof. For the case of 2PRA^, it is 
not so obvious. 

Ackermann locates the difficulty in the following complication. Suppose 
you have a functional (f>t,{2,b{b)). Here, b{b) is a term which denotes a 
function, and so there is no way to replace the variable b with a numeral 
before evaluating the entire term. In effect, the variable b is bound (in modern 
notation, the term might be more suggestively written (p{2,'kb.b{b)).) In order 
to reduce this term, we apply the recursion equations for cp and might end up 
with a term like 

b(l) + b(2) + b(0)-b(l) + b(I)-t)(2). 

The remaining b's might in turn contain (p. By contrast, reducing a term \\f{^) 
where \|/ is defined by first-order primitive recursion results in a term which 
does not contain \|/. 

To show that the reduction indeed comes to an end if iimermost subterms 
are reduced first, Ackermann proposes to assign indices to terms and show 
that each reduction reduces this index. The indices are, essentially, ordinal 
notations < CO® . Since this is probably the first proof using ordinal notations, 
it may be of some interest to repeat and analyze it in some detail here. In 
my presentation, I stay close to Ackermann's argument and only change the 
notation for ranks, indices, and orders: Where Ackermann uses sequences of 
natural numbers, I will use the more perspicuous ordinal notations. 

Suppose the primitive recursive functions are arranged in a linear order 
according to the order of definition. We write (p < \|/ if (p occurs before \|/ 
in the order of definition, i.e., \\r cannot be used in the defining equations 
for (p. Suppose further that we are given a specific term t. The notion of 
subordination is defined as follows: an occurrence of a function symbol ^ in f 
is subordinate to an occurrence of (p, if (p is the outermost function symbol of 
a subterm s, the occurrence of ^ is in s, and the subterm of s with outermost 
function symbol ^ contains a bound variable b in whose scope the occurrence 
of (p is (this includes the case where b happens to be bound by (p itself).^^ In 
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other words, t is of the form 

t'{...M--M--b. ..)...)...) 

The rank rk{t, (p) of an occurrence of a function symbol (p with respect to t is 
defined as follows: If there is no occurrence of \|/ >(p which is subordinate to 
(p in t, then r^(t, cp) = 1. Otherwise, 

r/:(t,(p) = max{r^(t,\|;) : \|/ >(p is subordinate to 9} + 1. 

The rank r(t, (p) of a term t with respect to a function symbol cp is the maxi- 
mum of the ranks of occurrences of (p or \|/ > (p in (If neither (p nor \|/ > (p 
occur in t, that means r(t, cp) = 0. 

We assign to a subterm s of t a sequence of ranks of . . . , \|/o with 
respect to s, where < ■ ■ ■ < all function symbols occurring in t. 

This is the order of s\ 

o{s) = {r{s,^ifn),...,r{s,^(i)) 

We may think of this order as an ordinal < co", specifically, o{s) corresponds 
to 

(if ■ r(s, \|/„) • + •■• + (0 • r(s, ) • +r(s, \|/o) 

Now consider the set of all distinct subterms of / of a given order o which 
are not numerals. The degree d{{,o) of o in i is the cardinaUty of that set. 
The index j{t) of t is the sequence of degrees ordered in the same way as the 
orders, i.e., it corresponds to an ordinal of the form 

Z'^"-dii,o) 

o 

where the sum ranges over all orders o. Obviously, this is an ordinal < co"". 

Ackermann does not use ordinals to define these indices; he stresses that 
he is only dealing with finite sequences of numbers, on which an elaborate 
order is imposed. Rather than appeal to the well-orderedness of co" , he gives 
a more direct argument that by repeatedly proceeding to indices which are 
smaller in the imposed order one eventually has to reach the index which 
consists of all 0. 

Suppose a term t not containing e or 71 is given. Let s be an innermost 
constant subterm which is not a numeral, we may assume it is of the form 
9&(3i! • • • )3«)Ui, . . . ,Um) where u,- is a term with at least one variable bound 
by (p and which doesn't contain a constant subterm. We have two cases: 

(1) s does not contain bound variables, i.e., m = 0. The order of s is co*^ 
(where cp = \^k)- Evaluating the term s by recursion results in a term s' in 
which only function symbols of lower index occur. Hence, the highest expo- 
nent in the order of s' is less than i, and so o(j') < o{s). Furthermore, since 
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no variable which is bound in t can occur in s' (since no such variable occurs 
in s), replacing s by s' in t does not result in new occurrences of function 
symbols which are subordinate to any other. Thus the number of subterms in 
the term t' which results from such a replacement with orders > o{s) remains 
the same, while the number of subterms of order o(s) is reduced by 1. Hence, 

m < m- 

(2) s does contain bound variables. For simplicity, assume that there is 
one numeral argument and one functional argument, i.e., s is of the form 
9^,(3, c{b)). In this case, all function symbols occurring in c{b) are subordinate 
to (p, or otherwise c{b) would contain a constant subterm.^^ Thus, the rank of 
c{b) in t with respect to is less that the rank of s with respect to \|/,-. 

We reduce the subterm s to a subterm s' by applying the recursion, s' does 
not contain the function symbol (p. We want to show that replacing s by s' in 
t lowers the index of t. 

First, note that when substituting a term a for b in c{b), the order of the 
resulting c(a) with respect to (p is the maximum of the orders of c{b) and 
0, since none of the occurrences of function symbols in a contain bound 
variables whose scope begins outside of o, and so none of these variables 
are subordinate to any function symbols in c{b). 

Now we prove the claim by induction on 3. Suppose the defining equation 
for (p is 

(pbia+lJib)) = bb{(pc{aj{c)),a,f{b)). 

If 3 = 0, then s' = ab{c{b)). At a place where f{b) is an argument to a func- 
tion, f{b) is replaced by c{d), and d is not in the scope of any cp (since a 
doesn't contain cp). For instance, ah{f{b)) = 2 + \|/rf(3,/(<i)). Such a replace- 
ment cannot raise the cp-rank of s' above that of c{b). The term c might also 
be used in places where it is not a functional argument, e.g., if ab{f{b)) = 
/(\|/(/(2))). By a simple induction on the nesting of /'s in ah{f{b)) it can be 
seen that the cp-rank of 5' is the same as that of c(^): For c(()) where t> does 
not contain c, the (p-rank of c(d) equals that of c{b) by the note above and the 
fact that d does not contain (p. If d does contain a nested c, then by induction 
hypothesis and the first case, its cp-rank is the same as that of c{b). By the 
note, again, the entire subterm has the same (p-rank as c{b). 

The case of 9^,(3 -I- 1 , c{b)) is similar. Here, the first replacement is 

bbi'?c{d,cic)),i,cib)). 

Further recursion replaces cpc(3,c(c)) by another term which, by induction 
hypothesis, has cp-rank less than or equal to that of c{b). The same considera- 
tions as in the base case show that the entire term also has a (p-rank no larger 
than c(^7). 



The Practice of Finitism 



23 



We have thus shown that eliminating the function symbol (p by recursion 
from an innermost constant term reduces the (p-rank of the term at least by 
one and does not increase the -ranks of any subterms for any j > i. 

In terms of ordinals, this shows that at least one subterm of order o was 
reduced to a subterm of order o' < o, all newly introduced subterms have 
order < o, and the order of no old subterm increased. Thus, the index of the 
entire term was reduced. The factor of ■ n changed to co" • (« — 1). 

We started with a given constant function, which we characterized by 
a determinate index. We replaced a(^b{i, c(^)) within that functional by 
another functional, where the cp-rank decreased and the rank with respect 
to functions to the right of cp [i.e., which come after cp in the order of 
definition] did not increase. Now we apply the same operation to the 
resulting functional. After finitely many steps we obtain a functional 
which contains no function symbols at all, i.e., it is a numeral. 

We have thus shown: a constant functional, which does not contain 
8 and 71, can be reduced to a numeral in finitely many steps. 

3.3. Ordinals, transfinite recursion, and finitism 

It is quite remarkable that the earliest extensive and detailed technical con- 
tribution to the finitist project would make use of transfinite induction in a 
way not dissimilar to Gentzen's later proof by induction up to Sq. This bears 
on a number of questions regarding Hilbert's understanding of the strength 
of finitism. In particular, it is often said that Gentzen's proof is not fini- 
tist, because it uses transfinite induction. However, Ackermann's original 
consistency proof for 2PRA also uses transfinite induction, using an index 
system which is essentially an ordinal notation system, just like Gentzen's. 
If it is granted that Ackermann's proof is finitistic, but Gentzen's is not, i.e., 
transfinite induction up to co" is finitistic but not up to Sq, then where — 
and why — should the line be drawn? Furthermore, the consistency proof of 
2PRA~ is in essence a — putatively finitistic — explanation of how to compute 
second order primitive recursive functions, and a proof that the computation 
procedure defined by them always terminates. In other words, it is a finitistic 
proof that second order primitive recursive functions are well defined. 

Ackermann was completely aware of the involvement of transfinite induc- 
tion in this case, but he sees in it no violation of the finitist standpoint. 

The disassembling of functionals by reduction does not occur in the 
sense that a finite ordinal is decreased each time an outermost function 
symbol is eliminated. Rather, to each functional corresponds as it were 
a transfinite ordinal number as its rank, and the theorem, that a constant 
functional is reduced to a numeral after carrying out finitely many op- 
erations, corresponds to the other [theorem], that if one descends from 
a transfinite ordinal number to ever smaller ordinal numbers, one has to 
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reach zero after a finite number of steps. Now tliere is naturally no men- 
tion of transfinite sets or ordinal numbers in our metamathematical in- 
vestigations. It is however interesting, that the mentioned theorem about 
transfinite ordinals can be formulated so that there is nothing transfinite 
about it. 

Consider a transfinite ordinal number less than CO ■ CO. Each such or- 
dinal number can be written in the form: CO • n + m, where n and m are 
finite numbers. Hence such an ordinal can also be characterized by a pair 
of finite numbers (n,m), where the order of these numbers is of course 
significant. To the descent in the series of ordinals corresponds the fol- 
lowing operation on the number pair (n,m). Either the first number n 
remains the same, then the number m is replaced by a smaller number 
m'. Or the first number n is made smaller; then I can put an arbitrary 
number in the second position, which can also be larger than m. It is 
clear that one has to reach the number pair (0,0) after finitely many 
steps. For after at most m + 1 steps I reach a number pair, where the first 
number is smaller than n. Let (n',m') be that pair. After at most m' + 1 
steps I reach a number pair in which the first number is again smaller 
than n', etc. After finitely many steps one reaches the number pair (0,0) 
in this fashion, which corresponds to the ordinal number 0. In this form, 
the mentioned theorem contains nothing transfinite whatsoever; only 
considerations which are acceptable in metamathematics are used. The 
same holds true if one does not use pairs but triples, quadruples, etc. 
This idea is not only used in the following proof that the reduction of 
functionals terminates, but will also be used again and again later on, 
especially in the finiteness proof at the end of the work.^^ 
Over ten years later, Ackermann discusses the application of transfinite 
induction for consistency proofs in correspondence with Bernays. Gentzen's 
consistency proof had been published (Gentzen, 1936), and Gentzen asks, 
through Bernays, 

whether you [Ackermann] think that the method of proving termination 
[Endlichkeitsbeweis] by transfinite induction can be applied to the con- 
sistency proof of your dissertation. I would like it very much, if that were 
possible.^^ 

In his reply, Ackermann recalls his own use of transfinite ordinals in the 1924 
dissertation. 

I just realized now, as I am looking at my dissertation, that I operate with 
transfinite ordinals in a similar fashion as Gentzen.^" 

A year and a half later, Ackermann mentions the transfinite induction used in 
his dissertation again: 

I do not know, by the way, whether you are aware (I did at the time not 
consider it as a transgression beyond the narrower finite standpoint), that 
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transfinite inferences are used in my dissertation. (Cf., e.g., the remarks 
in the last paragraph on page 13 and the following paragraph of my 
dissertation)."*^ 

These remarks may be puzzling, since they seem to suggest that Bernays was 
not familiar with Ackermann's work. This is clearly not the case. Bernays cor- 
responded with Ackerman extensively in the mid-20s about the 8-substitution 
method and the decision problem, and had clearly studied Ackermann's dis- 
sertation. Neither Bernays nor Hilbert are on record objecting to the methods 
used in Ackermann's dissertation. It can thus be concluded that Ackermann's 
use of transfinite induction was considered acceptable from the finitist stand- 
point. 

3.4. The e- substitution method 

As we have seen above, Hilbert had outlined an idea for a consistency proof 
for systems involving e-terms already in early 1922 (Hilbert, 1922a), and 
a Uttle more precisely in his talk of 1922 (Hilbert, 1923) and in the 1922- 
23 lectures (Hilbert and Bernays, 1923a). Let us review the Ansatz in the 
notation used in 1924: Suppose a proof involves only one 8 term eaA(a) and 
corresponding critical formulas 

A{li)^A{ZaA{a)), 

i.e., substitution instances of the transfinite axiom 

A{a)^A{ZaA{a)). 

We replace Za^i^) everywhere with 0, and transform the proof as before 
by rewriting it in tree form ("dissolution into proof threads"), eliminating 
free variables and evaluating numerical terms involving primitive recursive 
functions. Then the critical formulas take the form 

A(3,-)^A(0), 

where 3, is the numerical term to which fi,- reduces. A critical formula can 
now only be false if A(3,) is true and A(0) is false. If that is the case, repeat 
the procedure, now substituting 3; for ZaA{a). This yields a proof in which all 
initial formulas are correct and no 8 terms occur. 

If critical formulas of the second kind, i.e., substitution instances of the 

induction axiom, 

ZaA{a)^O^A{hZaA{a)), 

also appear in the proof, the witness 3 has to be replaced with the least 3' so 
that A (3') is true. 

The challenge was to extend this procedure to (a) cover more than one 
8-term in the proof, (b) take care of nested 8-terms, and lastly (c) extend it 
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to second-order e's and terms involving them, i.e, 8/21^ (/(a)). This is what 
Ackermann set out to do in the last part of his dissertation, and what he and 
Hilbert thought he had accomplished.^^ 

The system for which Ackermann attempted to give a consistency proof 
consisted of the system of second-order primitive recursive arithemtic (see 
Section 3.1 above) together with the transfinite axioms: 

1. A{a)^A{eaA{a)) A„/(a) ^ A„((S/Afe(/(^7)))(a)) 

2. A{eaA{a)) ^naA{a)=0 Aa{efAh{f{b)){a)) ^nfAa{f{a))=0 

3. AiEaAia)) ^ KgAia) = 1 A„(8/Afe(/(Z7))(a)) ^ 7l/A„(/(a)) = 1 

4. eaA(a)/0^A(8(e«A(a)))43 

The intuitive interpretation of 8 and 71, based on these axioms, is obvious: 
£a2l(a) is a witness for 2l(fl) if one exists, and 7ia2l(a) = 1 if 21(a) is false 
for all a, and = otherwise. The Ji functions are not necessary for the de- 
velopment of mathematics in the axiom system. They do, however, serve a 
function in the consistency proof, viz., to keep track of whether a value of 
for ea2t(a) is a "default value" (i.e., a trial substitution for which 21(a) may or 
may not be true) or an actual witness (a value for which 21(a) has been found 
to be true). 

I shall now attempt to give an outline of the £-substitution procedure de- 
fined by Ackermann. For simplicity, I will leave the case of second-order 
e-terms (i.e., those involving e/) to the side. 

An e-term is an expression of the form ea2l(a), where a is the only free 
variable in 21, and similarly for a Jt-term. For the purposes of the discussion 
below, we will not specifically refer to Ti's unless necessary, and most defini- 
tions and operations apply equally to 8-terms and 7i-terms. If a formula A (a) 
or an e-term £a2l(a) contains no variable-free subterms which are not numer- 
als, we call them canonical. Canonical formulas and e-terms are indicated by 
a tilde: ea2l(a). 

The main notion in Ackermann's proof is that of a total substitution S 
(Gesamtersetzung). It is a mapping of canonical e- and 7l-terms to numerals 
and or 1, respectively. When canonical e-terms in a proof are successively 
replaced by their values under the mapping, a total substitution reduces the 
proof to one not containing any e's. If S maps ea2l(a) to 3 and 7ia2t(a) to /, then 
we say that 2i(a) receives a (3,i) substitution under S and write 5(2l(a)) = 
(3,i). 

It is of course not enough to define a mapping from the canonical e-terms 
occuring in the proof to numerals: The proof may contain, e.g., efl2l(a, cp(e/, 53(^7))). 
To reduce this to a numeral, we first need a value 3 for the term ei,^{b). 
Replacing e},^{b) by 3, we obtain ta^{a,<p{^)). Suppose the value 9(3) is 3'. 
The total substitution then also has to specify a substitution for ea2l(a,3'). 
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Given a total substitution S, a proof is reduced to an 8-free proof as fol- 
lows: First all 8-free terms are evaluated. (Such terms contain only numerals 
and primitive recursive functions; these are computed and the term replaced 
by the numeral corresponding to the value of the term) Now let en, en, 
... be all the innermost (canonical) s- or 7t-terms in the proof, i.e., all 8- 
or Ji-terms which do not themselves contain nested 8- or 7l-terms or con- 
stant (variable-fee) sub terms which are not numerals. The total substitution 
specifies a numeral substitution for each of these. Replace each ci, by its 
corresponding numeral. Repeat the procedure until the only remaining terms 
are numerals. We write |e|5 for the result of applying this procedure to the 
expression (formula or term) e. Note that |e|5 is canonical. 

Based on this reduction procedure, Ackermann defines a notion of subor- 
dination of canonical formulas. Roughly, a formula ^{b) is subordinate to 
2l(fl) if in the process of reducing some formula 21(3), an 8-term ei,^{b) is 
replaced by a numeral. For instance, a = b is subordinate to a = eb{a = b). 
Indeed, if 21(a) is a = eb{a = b), then the reduction of 21(3) = 3 = 8^,(3 = Z?) 
would use a replacement for the 8-term belonging to !B(3 = b).'^'^ It is easy to 
see that this definition corresponds to the notion of subordination as defined 
in Hilbert and Bernays (1939). An e-expression is an expression of the form 
8a2l(a). If Sa2l(a) contains no free variables, it is called an e-term. If an 8- 
term 8/,!B(Zj) occurs in an expression (and is different from it), it is said to be 
nested in it. If an 8-expression 8^,03 (a, Z?) occurs in an expression in the scope 
of 8a, then it is subordinate to that expression. Accordingly, we can define 
the degree of an 8-term and the rank of an 8-expression as follows: An 8-term 
with no nested 8-subterms is of degree 1 ; otherwise its degree is the maximum 
of the degrees of its nested 8-subterms +1. The rank of an 8-expression with 
no subordinate 8-expressions is 1; otherwise it is the maximum of the ranks 
of its subordinate 8-expressions +1. If %{h) is subordinate to %{a) according 
to Ackermann's definition, then Zi,^{b) is subordinate in the usual sense to 
8a2l(a), and the rank of th'^{b) is less than that of 8a2i(a). The notion of de- 
gree corresponds to an ordering of canonical formulas used for the reduction 
according to a total substitution in Ackermann's procedural definition: First 
all 8-terms of degree 1 (i.e., all innermost 8-terms) are replaced, resulting 
(after evaluation of primitive recursive functions) in a partially reduced proof. 
The formulas corresponding to innermost 8-terms now are reducts of 8-terms 
of degree 2 in the original proof. The canonical formulas corresponding to 
8-terms of degree 1 are called the formulas of level 1, the canonical formulas 
corresponing to the innermost 8-terms in the results of the first reduction step 
are the formulas of level 2, and so forth. 

The consistency proof proceeds by constructing a sequence 5i, 52, . . . of 
total substitutions together with bookkeeping functions /, (2l(fl), 7) {0, 1},"^'' 
which eventually results in a solving substitution, i.e., a total substitution 
which reduces the proof to one which contains only correct 8-free formulas. 
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We begin with a total substitution which assigns (0,1) to all canonical 
formulas, and set f\ (2t(a), 1) = 1 for all 2t(a) for which 5i assigns a value. If 
Si is a solving substitution, the procedure terminates. Otherwise, the next total 
substitution is obtained as follows: If 5, is not a solving substitution, at 
least one of the critical formulas in the proof reduces to an incorrect formula. 
We have three cases: 

1. Either an 8-axiom 2t(a) — > 2l(8a2l(a)) or a Jl-axiom of the first kind 
2l(8a2t(a)) 7ia2l(a) = reduces to a false formula of the form 21(3) ~^ 
a(0) or 21(0) ^1=0, and 5;(2i(a)) = (0,1). Pick one such 2i(a) of 
lowest level (i.e., £^21(0') of lowest degree). 

If 21(0) ^ 1 = is incorrect, 21(0) must be correct; let Si+i{A{a)) = 
(0,0). Otherwise 21(3) 21(0) is incorrect and hence 21(3) must be cor- 
rect; then let 5,+i(2i(a)) = (3,0). In either case, set /;+i(2t(a), j+ 1) = 
1. 

For other formulas ^(b), Si^\ {^{b)) = Sj{^{b)) where j is the greatest 
index < / such that fil^{b),j) = 1. Si+i{^{b)) = (0, 1) if no such ; ex- 
ists (i.e., ^{b) has never before received an example substitution). Also, 
let /,+ 1 (*B (Zj) , / + 1 ) = 1 . For all canonical formulas £(c), let fi+i (€(c) , j) = 
fi{l{c)J)iorj<i. 

2. Case (1) does not apply, but at least one of the minimaUty axioms Sa2t(a) 7^ 
2l(5(£a2l(fl)) reduces to a false formula, 3 7^ ^ 21(3 ~ 1)- This is 
only possible if 5,(21(0)) = (3,0). Again, pick the one of lowest level, 
and let (21(a)) = (3 - 1,0) and /;+i(2i(a), j -|- 1) = 1. Substitutions 
for other formulas and bookkeeping functions are defined as in case (1). 

3. Neither case (1) nor (2) appUes, but some instance of an e-axiom of 
the form 21(a) 2l(8a2t(a)) or of a 7l-axiom of the form 2t(8a2t(a)) — 

7la2l(a) = 1, e.g., 21(a) 21(3)) or 21(3) ^0=1, reduces to an incorrect 
formula. We then have Si(pi{a)) = (3,0) (since otherwise case (1) would 
apply). In either case, |2t(3)) must be incorrect. Let j be the least index 
where 5y(2l(a)) = (3,0) and fj{^{a))J) = 1. At the preceding total sub- 
stitution 5^-_i, 5X2t(fl)) = (0, 1) or5y_i(A(fl)) = (3 + 1,0), and |2l(3)|5^._, 
is correct. 21(3) thus must reduce to different formulas under 5^-1 and 
under 5,-, which is only possible if a formula subordinate to 21 reduces 
differently under Sj-i and 5,-. 

For example, suppose 2t(a) is really ^{a,tb%{a,b)). Then the corre- 
sponding 8-axiom would be 

2i(a,8fo<B(a,^7)) ^ 2i(8«2i(a,8f,<B(a,^7)),8f,«8(8«2t(a,8fe<B(a,^7)),&)) 
V ' V ' 

e„2l(a) 8„2l(a) 
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An instance thereof would be 

^{a,tb^{a,b) ^ ^{ta^{a,Zh^{a,b)),eh^{ta&^{a,eb^{a,b)),b)). 

This formula, under a total substitution with 5,(2i(a, 8^*8(^,^7))) = (3,0) 
reduces to 

^{a,eb^ia,b) ^^ii,Et^{i,b)) 

The consequent of this conditional, i.e., 21(3), can reduce to different for- 
mulas under 5; and 5y_i only if 8^,^(3,^7) receives different substitutions 
under 5/ and 5^-1, and ^{a,b) is subordinate to 21(a). 

The next substitution is now defined as follows: Pick an innermost for- 
mula subordinate to 2t(a) which changes substitutions, say ^B(^). For 
all formulas C(c) which are subordinate to '^{b) as well as ^{b) itself, 
we set fi+i{l{c),k) = 1 for j <k<i+\ and /,+i(e(c),A:) = for all 
other formulas. For k < j we set /i+i (C(c),«) = fi{^{c),k) for all C(c). 
The next substitution is now given by S'i+i (C(c)) = Sk{€{c)) for k 
greatest such that fi+i (£(c),A:) = 1 or = (0, 1) if no such k exists. 

Readers familiar with the substitution method defined in Ackermann (1940) 
will note the following differences: 

a. Ackermann (1940) uses the notion of a type of an 8-term and instead of 
defining total substitutions in terms of numeral substitutions for canonical 
8-terms, assigns a function of finite support to 8-types. This change is 
merely a notational convenience, as these functional substitutions can 
be recovered from the numeral substitutions for canonical 8-terms. For 
example, if S assigns the substitutions to the canonical terms on the left, 
then a total substitution in the sense of Ackermann (1940) would assign 
the function g on the right to the type £a2l(a,Zj): 

5(2l(a,3i)=3; ^(8,2t(a,^))(3i)=3; 

5(2l(a,33)=32 g(8«2l(a,Z7))(32)=32 
5(2l(a,32)=33 g(8«2l(a,^7))(33)=3^ 

b. In case (2), deahng with the least number (induction) axiom, the next sub- 
stitution is defined by reducing the substituted numeral 3 by 1, whereas 
in (Ackermann, 1940), we immediately proceed to the least 3' such that 
21(3') is correct. This makes the procedure converge more slowly, but also 
suggests that in certain cases (depending on which other critical formulas 
occur in the proof), the solving substitution does not necessarily provide 
example substitutions which are, in fact, least witnesses. 

c. The main difference in the method lies in case (3). Whereas in (1940), ex- 
ample substitutions for all 8-types of rank lower than that of the changed 
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8a2t(a) are retained, and all others are reset to initial substitutions (func- 
tions constant equal to 0), in (1924b), only the substitutions of some 
£-terms actually subordinate to Sa2l(a) are retained, while others are not 
reset to initial substitutions, but to substitutions defined at some previous 
stage. 

3.5. Assessment and complications 

A detailed analysis of the method and of the termination proof given in the last 
part of Ackermann's dissertation must wait for another occasion, if only for 
lack of space. A preliminary assessment can, however, already be made on the 
basis of the outline of the substitution process above. Modulo some needed 
clarification in the definitions, the process is well-defined and terminates at 
least for proofs containing only least-number axioms (critical formulas cor- 
responding to axiom (4)) of rank 1. The proof that the procedure terminates 
(§9 of Ackermann (1924b)) is opaque, especially in comparison to the proof 
by transfinite induction for primitive recursive arithmetic. The definition of a 
substitution method for second-order e-terms is insufficient, and in hindsight 
it is clear that a correct termination proof for this part could not have been 
given with the methods available."*^ 

Leaving aside, for the time being, the issue of what was actually proved 
in Ackermann (1924b), the question remains of what was believed to have 
been proved at the time. The system, as given in (1924b), had two major 
shortcomings: A footnote, added in proof, states: 

[The formation of e-terms] is restricted in that a function variable f{a) 
may not be substituted by a functional a{a), in which a occurs in the 
scope of an 8/.^' 
This appUes in particular to the second-order e-axioms 

A,{f{a))^Aa{efAh{m){a)). 

If we view £fAi,{f{b)){a) as the function "defined by" A, and hence the 8- 
axiom as the £-calculus analog of the comprehension axiom, this amounts 
roughly to a restriction to arithmetic comprehension, and thus a predicative 
system. This shortcoming, and the fact that the restriction turns the system 
into a system of predicative mathematics was pointed out by von Neumann 
(1927). 

A second lacunae was the omission of an axiom of 8-extensionality for 
second-order 8-terms, i.e., 

mm) ^ B{f )) ^ 8yA(/) = £yB(/), 

which corresponds to the axiom of choice. Both problems were the subject 
of correspondence with Bemays in 1925.'*^ A year later, Ackermann is still 
trying to extend and correct the proof, now using 8-types: 
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I am currently working again on the 8/-proof and am pushing hard to 
finish it. I have already told you that the problem can be reduced to 
one of number theory. To prove the number-theoretic theorem seems to 
me, however, equally hard as the problem itself. I am now again taking 
the approach, which I have tried several times previously, to extend the 
definition of a ground type so that even 8 with free function variables 
receive a substitution. This approach seems to me the most natural, and 
the equahty axioms (/) (A (/) ^ S (/) ) ^ 8/A/ = 8/S/ would be treated 
simultaneously. I am hopeful that the obstacles previously encountered 
with this method can be avoided, if 1 use the 8^ formalism and use sub- 
stitutions for the 8/ which may contain 8^ instead of functions defined 
without 8. 1 have, however, only thought through some simple special 

49 

cases. 

In 1927, Ackermann developed a second proof of 8-substitution, using some 
of von Neumann's ideas (in particular, the notion of an 8-type, Grundtyp). 
The proof is unfortunately not preserved in its entirety, but references to it 
can be found in the correspondence. On April 12, 1927, Bemays writes to 
Ackermann: 

Finally 1 have thought through your newer proof for consistency of the 
8a 's based on what you have written down for me befor your departure, 
and beUeve that I have seen the proof to be correct.^^ 
Ackermann also refers to the proof in a letter to Hilbert from 1933: 

As you may recall, 1 had at the time a second proof for the consis- 
tency of the 8a 's. 1 never published that proof, but communicated it to 
Prof. Bemays orally, who then verified it. Prof. Bemays wrote to me 
last year that the result does not seem to harmonize with the work of 
Godel.5i 

In Hubert's address to the Intemational Congress of Mathematicians in 1928 
(Hilbert, 1928), the success of Ackermann's and von Neumann's work on e- 
substitution for first-order systems is also taken for granted. Although Hilbert 
poses the extension of the proof to second-order systems as an open problem, 
there seems no doubt in his mind that the solution is just around the corner.^^ 
It might be worthwhile to mention at this point that at roughly the same 
time a third attempt to find a satisfactory consistency proof was made. This at- 
tempt was based not on 8-substituion, but on Hilbert's so-called unsuccessful 
proof (yerungluckter Beweis). 

While working on the Grundlagenbuch, I found myself motivated to re- 
think Hilbert's second consistency proof for the 8-axiom, the so-called 
"unsuccessful" proof, and it now seems to me that it can be fixed after 

This proof bears a striking resemblance to the proof the first e-theorem in 
(Hilbert and Bemays, 1939) and to a seven-page sketch in Bemays's hand of a 
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"consistency proof for the logical axiom of choice" found bound with lecture 
notes to Hubert's course on "Elements and principles of mathematics" of 
1910.^^ This "unsuccessful" proof seems to me to be another but independent 
contribution to the development of logic and the e-calculus, independent of 
the substituion method. Note that Bemays's proof of Herbrand's theorem in 
(Hilbert and Bernays, 1939) is based on the (second) 8-theorem is the first 
correct published proof of that important result. 

The realization that the consistency proof even for first-order e's was prob- 
lematic came only with Godel's incompleteness results. In a letter dated March 10, 
1931, von Neumann presents an example that shows that in the most recent 
version of Ackermann's proof, the length of the substitution process not only 
depends on the rank and degree of 8-terms occurring in the proof, but also on 
numerical values used as substitutions. He concludes: 

I think that this answers the question, which we recently discussed when 
going through Ackermann's modified proof, namely whether an estimate 
of the length of the correction process can be made uniformly and inde- 
pendently of numerical substituends, in the negative. At this point the 
proof of termination of the procedure (for the next higher degree, i.e., 3) 
has a gap.^^ 

There is no doubt that the discussion of the consistency proof was precip- 
itated by Godel's results, as both von Neumann and Bernays were aware of 
these results, and at least von Neumann realized the implications for Hilbert's 
Program and the prospects of a finitistic consistency proof for arithmetic. 
Bernays corresponded with Godel on the relevance of Godel's result for the 
viability of the project of consistency proofs just before and after von Neu- 
mann's counterexample located the difficulty in Ackermann's proof. On Jan- 
uary 18, 1931, Bernays writes to Godel: 

If one, as does von Neumann, assumes as certain that any finite consid- 
eration can be formulated in the framework of System P — ^I think, as you 
do too, that this is not at all obvious — one arrives at the conclusion that 
a finite proof of consistency of P is impossible. 

The puzzle, however, remained unresolved for Bernays even after von Neu- 
mann's example, as he writes to Godel just after the exchange with von 
Neumann, on April 20, 1931: 

The confusion here is probably connected to that about Ackermann's 
proof for the consistency of number theory (System 3), which I have not 
so far been able to clarify. 

That proof — on which Hilbert has reported in his Hamburg talk on 
the "foundations of mathematics"^^' [. . . ] — I have repeatedly thought 
through and found correct. On the basis of your results one must now 
conclude that this proof caimot be formaUzed within System 3; indeed, 
this must hold even if one restricts the system whose consistency is 
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to be proved by leaving only addition and multiplication as recursive 
definitions. On the other hand, I do not see which part of Ackermann's 
proof makes the formalization within 3 impossible, in particular if the 
problem is so restricted.^' 

Godel's results thus led Bemays, and later Ackermann to reexamine the meth- 
ods used in the consistency proofs. A completion of the project had to wait 
until 1940, when Ackermann was able to carry through the termination proof 
based on transfinite induction — following Gentzen (1936) — on eo. 



4. Conclusion 

With the preceding exposition and analysis of the development of axiomatiza- 
tions of logic and mathematics and of Hilbert and Ackermann's consistency 
proofs I hope to have answered some open questions regarding the historical 
development of Hilbert's Program. Hilbert's technical project and its evolu- 
tion is without doubt of tremendous importance to the history of logic and the 
foundations of mathematics in the 20th century. Moreover, an understanding 
of the technical developments can help to inform an understanding of the 
history and prospects of the philosophical project. The lessons drawn in the 
discussion, in particular, of Ackermann's use of transfinite induction, raise 
more questions. The fact that transfinite induction in the form used by Acker- 
mann was so readily accepted as finitist, not just by Ackermann himself, but 
also by Hilbert and Bemays leaves open two possibilities: either they were 
simply wrong in taking the finitistic nature of Ackermann's proof for granted 
and the use of transfinite induction simply cannot be reconciled with the fini- 
tist standpoint as characterized by Hilbert and Bemays in other writings, or 
the common view of what Hilbert thought the finitist standpoint to consist in 
must be revised. Specifically, it seems that the explanation of why transfinite 
induction is acceptable stresses one aspect of finitism while downplaying 
another: the objects of finitist reasoning are — finite and — intuitively given, 
whereas the methods of proof were not required to have the epistemic strength 
that the finitist standpoint is usually thought to require (i.e., to guarantee, in 
one sense or another, the intuitive evidence of the resulting theorems). Of 
course, the question of whether Hilbert can make good on his claims that 
finitistic reasoning affords this intutive evidence of its theorems is one of the 
main difficulties in a philosphical assessment of the project (see, e.g.. Parsons 
(1998)). 

I have already hinted at the implications of a study of the practice of 
finitism for philosophical reconstructions of the finitist view (in note 4). We 
are of course free to latch on to this or that aspect of Hilbert's ideas (finitude, 
intuitive evidence, or surveyabiUty) and develop a philosophical view around 
it. Such an approach can be very fmitful, and have important and inightful 
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results (as, e.g, the example of Tait's (1981) work shows). The question is to 
what extent such a view should be accepted as a reconstruction of Hilbert's 
view as long as it makes the practice of the technical project come out off 
base. Surely rational reconstruction is governed by something like a principle 
of charity. Hilbert and his students, to the extent possible, should be construed 
so that what they preached is reflected in their practice. This requires, of 
course, that we know what the practice was. If nothing else, I hope to have 
provided some of the necessary data for that. 
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Notes 

^ Hilbert (1905, 131). For a general discussion of Hilbert's views around 1905, see Peck- 
haus (1990, Chapter 3). 

^ „Die von zermelo benutze axiomatische Methode ist zwar unanfechtbar und unentbehrUch. 
Es bleibt dabei doch die Frage offen, ob die aufgestellten Axiome nicht etwa einen Widerspruch 
einschliessen. Ferner erhebt sich die Frage, ob und inwieweit sich das Axiomensystem aus der 
Logik ableiten lasst. [. . . D]er Versuch einer Zuriickfiihrung auf die Logik scheint besonders 
deshalb aussichtsvoU, weil zwischen Mengen, welche ja die Gegenstande in zermelos Axiomatik 
bilden, und den Pradikaten der Logik ein enger Zusammenhang besteht. Namlich die Mengen 
lassen sich auf Pradikate zuriickfuhren. 

Diesen Gedanken haben FREGE, RUSSEL[L] und WEYL zum Ausgangspunkt genommen 
bei ihren Untersuchungen iiber die Grundlagen der Mathematik." (Hilbert 1920, 27-28). 

^ „Wir mussen uns namlich fragen, was es bedeuten soU: „es gibt ein Pradikat P." In der 
axiomatischen Mengentheorie bezieht sich das „es gibt" immer auf den zugrunde gelegten 
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Bereich 33 . In der Logik konnen wir zwar auch die Pradikate zu einem Bereich zusammengefasst 
denken; aber dieser Bereich der Pradikate kann hier nicht als etwas von vorneherein Gegebenes 
betrachtet werden, sondern die Pradikate miissen durch logische Operationen gebildet werden, 
und durch die Regeln der Konstruktion bestiimnt sich erst nachtraghch der Pradikaten-Bereich. 

Hiernach ist ersichtlich, dass bei den Regeln der logischen Konstruktion von Pradikaten 
die Bezugnahme auf den Pradikaten-Bereich nicht zugelassen werden kann. Denn sonst ergabe 
sich ein circulus vitiosus." (Hilbert 1920, 31). 

^ "RUSSELL geht von dem Gedanken aus, dass es geniigt, das zur Definition der Vereinigungsmenge 
unbrauchbare Pradikat durch ein sachlich gleichbedeutendes zu ersetzen, welches nicht dem 
gleichen Einwande unterliegt. Allerdings vermag er ein solches Pradikat nicht anzugeben, aber 
er sieht es als ausgemacht an, dass ein solches existiert. In diesem Sinne stellt er sein „Axiom 
der Reduzierbarkeit" auf, welches ungefahr folgendes besagt: „Zu jedem Pradikat, welches 
durch (ein- oder mehrmalige) Bezugnahme auf den Pradikatenbereich gebildet ist, gibt es ein 
sachlich gleichbedeutendes Pradikat, welches keine solche Bezugnahme aufweist. 

Hiermit kehrt aber russell von der konstniktiven Logik zu dem axiomatischen Standpunkt 
zuriick. [. . . ] 

Das Ziel, die Mengenlehre und damit die gebrauchlichen Methoden der Analysis auf die 
Logik zurtickzuftihren, ist heute nicht erreicht und ist vielleicht iiberhaupt nicht erreichbar." 

(Hilbert 1920, 32-33). 

^ ". . . zur Vermeidung von Paradoxien ist daher eine teilweise gleichzeitige Entwicklung 
der Gesetze der Logik und der Arithmetik erforderlich." (Hilbert 1905, 176). 

* "In Anbetracht der grossen Mannigfaltigkeit von Verkniipfungen und Zusammenhangen, 
welche die Arithmetik aufweist, ist es von vornherein ersichtlich, dass wir die Aufgabe des 
Nachweises der Widerspruchslosigkeit nicht mit einem Schlage losen konnen. Wir werden 
vielmehr so vorgehen, dass wir zunachst nur die einfachsten Verkniipfimgen betrachten vmd 
dann schrittweise immer hohere Operationen und Schlussweisen hinzunehmen, wobei dann 
fiir jede Erweiterung des Systems der Zeichen und der Uebergangsformeln einzeln der Nachweis 
zu erbringen ist, dass sie die auf der vorherigen Stufe festgestellte Widerspruchsfreiheit nicht 
aufheben. 

Ein weiterer wesentlicher Gesichtspunkt ist, dass wir, gemass vmserem Plan der restlosen 

Formalisiemng der Arithmetik, den eigentlich mathematischen Formalismus im Zusammenhang 
mit dem Formalismus der logischen Operationen entwickeln miissen, sodass — wie ich es 
ausgedriickt habe — ein simultaner Aufbau von Mathematik und Logik ausgefiihrt wird.", (Hilbert 1922b, 
8a-9a). The passage is not contained in Kneser's notes (Hilbert and Bemays, 1923a) to the 
same course. 

^ The notes by Kneser (Hilbert and Bernays, 1923a) do not contain the list of systems 
below. The version of the e-calculus used in the addendum is the same as that used in Kneser's 
notes, and differs from the presentation in Ackermann (1924b), submitted February 20, 1924. 

^ (Hilbert and Bemays, 1923b), 17, 19 

' "Disposition. Stufe II war elementares Rechnen Axiome 1-16 

Stufe III. Nun elementare Zahlentheorie 
Schema fiir Def. von Funktionen durch Rekursion u. Schlussschema 
woUen [?] unser Schlussschema noch das Induktionsschema hinzuziehen 
Wenn auch inhaltlich das wesentlich mit den Ergebnissen der anschauliche gewonnenen [?] 
Zahlenth. iibereinstimmt, so doch jetzt Formeln z.B. a + h = h + a. 

Stufe nil. Transfinite Schlussweise u. teilweise Analysis 

Stufe V. Hiihere Variablen-Gattimgen u. Mengenlehre. Auswahlaxiom 

Stufe VI. Zahlen d[er] 2^^" Zahlkl[asse], Voile transfin[ite] Induktion. Hohere Typen. 
Continuumsproblem, transfin[ite] Induktion fiir Zahlen der 2'^"* ZahUd. 
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Stufe VII. 1.) Ersetzung der oo vielen Definitionsschemata durch ein Axiom. 2.) Analysis 
u[nd] Mengenlehre. Auf der 4*'^" Stufe nochmals der voile Satz von der oberen Grenze 

Stufe Vin. Formalisierung der Wohlordnung. " (Hilbert and Bemays, 1923a), Ergdnzung, 
sheet 1. 

1" The proof can also be found in Hilbert (1922c, 171-173); cf. Mancosu (1998, 208-210). 
' ' "Somit sehen wir uns veranlasst, die Beweise als solche zum Gegenstand der Untersuchung 
zu machen; wir werden zu einer Art von Beweistheorie gedrangt, welche mit den Beweisen 
selbst als Gegenstanden operiert. 

Fiir die Denkweise der gewohnlichen Zahlentheorie sind die Zahlen das gegenstandlich- 
Aufweisbare, und die Beweise der Satze iiber die Zahlen fallen schon in das gedankliche 
Gebiet. Bei unserer Untersuchung ist der Beweis sebst etwas Aufweisbares, und durch das 
Denken iiber den Beweis kommen wir zur Losung unseres Problems. 

Wie der Physiker seinen Apparat, der Astronom seinen Standort untersuch, wic der Philosoph 
Vernunft-Kritik iibt, so braucht der Mathematiker diese Beweistheorie, um jeden mathematischen 
Satz durch eine Beweis-Kritik sicherstellen zu konnen." Hilbert (1920, 39^0). Almost the 
same passage is found in Hilbert (1922c, 169-170), cf. Mancosu (1998, 208). 

For a detailed discussion of these influences, see Mancosu (1999a). 
'■^ (Hilbert 1922b, part 2, 3). Kneser's Mitschrift of these lectures contains a different system 
which does not include negation. Instead, numerical inequality is a primitive. This system is 
also found in Hilbert's first talks on the subject in Copenhagen and Hamburg in Spring and 
Summer of 1921. Hilbert (1923), a talk given in September 1922, and Kneser's notes to the 
course of Winter Semester 1922-23 (Hilbert and Bernays, 1923b) do contain the new system 
with negation. This suggests that the developments of Hilbert's 1921-22 lectures were not 
incorporated into the published version of Hilbert's Hamburg talk (1922c). Although (1922c) 
was published in 1922, and a footnote to the title says 'This communication is essentially the 
content of the talks which I have given in the Spring of this year in Copenhagen [. . . ] and 
in the Summer in Hamburg [. . . ]," it is clear that the year in question is 1921, when Hilbert 
addressed the Mathematisches Seminar of the University of Hamburg, July 25-27, 1921. A 
report of the talks was published by Reidemeister in Jahrbuch der Deutschen Mathematiker- 
Vereinigung 30, 2. Abt. (1921), 106. Hilbert and Bemays (1923b) also have separate axioms 
for conjunction and disjunction, while in (1923) it is extended it by quantifiers. 

^'^ The procedure whereby we pass from 21 to 21' is simple in this case, provided we keep 
track of which variables are substituted for below the inference. In general, the problem of 
deciding whether a formula is a substitution instance of another, and to calculate the substitu- 
tion which would make the latter syntactically identical to the former is known as matching. 
Although not computationally difficult, it is not entirely trivial either. 

"Nennen wir eine Formel, in der keine Variablen und keine Funktionale ausser Zahlzeichen 
vorkommen, eine „explizite [numerische] Former, so konnen wir das gefundene Ergebnis so 
aussprechen: Jede beweisbare explizite [numerische] Formel ist Endformel eines Beweises, 
dessen samtliche Formeln explizite Formeln sind. 

Dieses miisste insbesondere von der Formel 7^ gelten, wenn sie beweisbar ware. Der 
verlangte Nachweis der Widerspruchsfreiheit ist daher erbracht, wenn wir zeigen, dass es 
keinen Beweis der Formel geben kann, der aus lauter expliziten Formeln besteht. 

Um diese Unmoglichkeit einzusehen, geniigt es, eine konkret feststellbare Eigenschaft zu 
finden, die erstens alien den expliziten Formeln zukommt, welche durch Einsetzung aus einem 
Axiom entstehen, die ferner bei einem Schluss sich von den Pramissen auf die Endformel 
iibertragt, die dagegen nicht auf die Formel 0^0 zutrifft." (Hilbert 1922b, part 2, 27-28). 

16 "Wir teilen die expliziten Formeln in „richtige" und ,falsche" ein. Die expliziten Primformeln 
sind Gleichungcn, auf deren beiden Seiten Zahlzeichen stchcn. Eine solche Gleichimg nennen 
wir richtig, wenn die beiderseits stehenden Zahlzeichen iibereinstimmen; andernfalls nennen 
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wir sie falsch. Eine Ungleichung, auf deren beiden Seiten Zahlzeichen stehen, nennen wir 
richtig, falls die beiden Zahlzeichen verschieden sind; sonst nen[n]en wir sie, falsch. 

In der Normalform einer beliebigen expUziten Formel haben alle Disjunktionsglieder die 
Gestalt von Gleichungen oder Ungleichungen, auf deren beiden Sieten Zahlzeichen stehen. 

Wir nennen nun eine allgemeine explizite Formel richtig, wenn in der zugehorigen Normalform 
jede als Konjunktionsglied auftretende (bezw. die ganze Normalform ausmachende) Disjunktion 
eine richtige Gleichung oder eine richtige Ungleichung als GUed enthalt. Andemfalls nennen 
wir die Formel /a/jc/z. [. . . ] 

Nach der gegebenen Definition lasst sich die Frage, ob eine expUzierte [sic] Formel richtig 
oder falsch ist, in jed[e]m Falle konkret entscheiden. Hier gilt also das „tertium non datui" 
[. . . ]" (Hilbert 1922b, part 2, 33). 
" Hilbert (1918b, 149-150). See also Zach (1999, §2.3). 

A sketch of the consistency proof is found in the Kneser Mitschrift to the 1921-22 lectures 
(Hilbert, 1922a) in Heft II, pp. 23-32 and in the official notes by Bernays (Hilbert 1922b, 
part 2, 19-38). The earlier Kneser Mitschrift leaves out step (1), and instead of eliminat- 
ing variables introduces the notion of einsetzungsrichtig (correctness under substitution, i.e., 
every substitution instance is correct). These problems were avoided in the official Bernays 
typescript. The Kneser notes did contain a discussion of recursive definition and induction, 
which is not included in the official notes; more about these in the next section. 

In the 1921-22 lectures, it is initially argued that the result of applying transformations 
(l)-(3) results in a proof of the same end formula (if substitutions are added to the initial 
formulas). Specifically, it is suggested that the result of applying elimination of variables and 
reduction of functionals to the axioms results in formulas which are substitution instances of 
axioms. It was quickly realized that this is not the case. (When Bernays presented the proof in 
the 1922-23 lectures on December 14, 1922, he comments that the result of the transformation 
need not be a proof (Hilbert and Bernays 1923b, 21). The problem is the axiom of equality 

a^h^{A(a) -^A{h)). 

Taking A(c) to be 8(c) = c, a substitution instance would be 

O-f H- 1 = -> (5(0+1 + 1) = 0-f- l-f 1 -> 5(0) = 0) 

This reduces to 

0+1 + 1 = 0^(0+1 =0+1 + 1 ^0 = 0) 
which is not a substitution instance of the equality axiom. The consistency proof itself is not 
affected by this, since the resulting formula is still correct (in Hilbert's technical sense of the 
word). The official notes to the 1921-22 lectures contain a 2-page correction in Bemays's 
hand (Hilbert 1922b, part 2, between pp. 26 and 27). 

The induction rule is not used in (Ackermann, 1924b), since he deals with stage III only 
in passing and attempts a consistency proof for all of analysis. There, the induction rule is 
superseded by an e-based induction axiom. For a consistency proof of stage HI alone, an 
induction rule is needed, since an axiom cannot be formulated without quantifiers (or e). 
The induction rule was introduced for stage III in the Kneser notes to the 1921-22 lectures 
(Hilbert, 1922a, Heft II, 32) and the 1922-23 lectures (Hilbert and Bernays, 1923b, 26). It is 
not discussed in the official notes or the publications from the same period (Hilbert, 1922c; 
1923 

^' The general tenor, outlook, and aims of Skolem's work are sufficiently different from 
Hilbert's to suggest there was no influence either way. Skolem states in his concluding remarks 
that he wrote the paper in 1919, after reading Russell and Whitehead's Principia Mathematica. 
However, neither Hilbert nor Bemays's papers contain an offprint or manuscript of Skolem's 
paper, nor correspondence. Skolem is not cited in any of Hilbert's, Bemays's, or Ackermaim's 
papers of the period, although the paper is referenced in (Hilbert and Bemays, 1934). 



38 



■^^ "Uns fehlt noch ganz das Ax[iom] der vollst[andigen] Induktion. Man konnte meinen, es 
ware 

{Z{a) ^ (A(«) ^A(a+ 1))} ^ {A(l) ^ (Z(/.) 

Das ist es nicht; denn man setze a = 1. Die Voraussetzung mu6 fiir alle a gelten. Wir haben 
aber noch gar kein Mittel, das Alle in die Voraussetzung zu bringen. Unser Formalismus reich 
noch nicht hin, das Ind.ax. aufzuschreiben. 

Aber als Schema konnen wir es: Wir erweitem unsere Beweismethoden durch das nebenstehende 
Schema 

Z{a) -> R{a) 

Jetzt ist es verniinftig, zu fragen, ob dies Schema zum Wspruch fiihren kann." (Hilbert 1922a, 
32). Z is the predicate expressing "is a natural number," it disappears from later formulations 
of the schema. 

"Wie ist es bei Rekursionen? (p(3) komme von Entweder 0, dann setzten wir o dafiir 
Oder 9(3 + 1): Beh[auptung]: Das Einsetzen kommt zu einem AbschluB, wenn wir 

zu innerst anfangen." (Hilbert and Bemays 1923a, 29) 

"Nicht endlich (durch Rek[ursion]) definiert ist z.B. 9(a) = wenn es ein b gibt, so 
daB +ab^ +7 Primz[ahl] ist sonst = 1. Aber erst bei solchen Zahlen und Funktionen 
beginnt das eigentliche math[ematische] Intcrcssc, well dort die Losbarkeit in endlich vielen 
Schritten nicht vorauszusehen ist. Wir haben die Uberzeugung, daB solche Fragen wie nach 
dem Wert (p(a) losbar, d.h. daB (p(a) doch endlich deflnierbar ist. Darauf konnen wir aber 
nicht warten: wir miissen solche Definitionen zulassen, sonst wiirden wir den fceien Betrieb 
der Wissenschaft einschranken. Auch den Begriff der Funktionenfunktion brauchen wir." 
(Hilbert 1922a, HeftlH, 1-2). 

A full proof is given by Ackermann (1924b). 

"Als erstes zeigt man, daB man alle Variablen fortschaffen kann, well auch hier nur freie 
Var[iable] vorkommen. Wir suchen die innersten x und a. Unter diesen stehen nur endlich 
definierte Funkt[ionen] (p, (p'. . .Unter diesen konnen einige im Laufe des Beweises flir / in 
die Ax[iome] eingesetzt sein. 1: x((p) = ^ (Z(o ^ (pa = 1) wo a ein Funktional ist. Wenn 
dies nicht benutzt wird, setze ich alle a((p) und x((p) gleich Null. Sonst reduziere ich a und 
(p{a) und sehe, ob Z(a (p(a) = 1 in alien ... wo sie vorkommt, richtig ist. Ist die richtig, so 
setze ich x = a = 0. 1st sic falsch, d.h. is a = 3 9(3 7^ 1, so sctzen wir 1(9) = 1, 01(9) = 3. 
Dabei bleibt der Beweis Beweis. Die an Stelle der Axiome gesetzten Formeln sind richtig. 

Der Gedanke ist: wenn ein Beweis vorliegt, so kann ich aus ihm ein Argument linden fiir 
das <p = 1 ist). So beseitigt man schrittweise die x und a aund Anwendungen von 12 3 4 und 
erhalt einen Beweis von 1 1 aus I-V und richtigen Formeln d.h. aus I-V, 

T{f,b)=0 ^ {Z{a) ^ f(a,b) = 1} 
x{f,b)^Q -> Z{f{a,b)) 

^{f,b)^0 ^ f{a{f,b),b)^l 

r{f,b)^o ^ x{f,b) = r 

(Hilbert, 1922a), Heft IH, 3^. The lecture is dated February 23, 1922. 
"Was fehlt uns? 

1. in logischer Hinsicht. Wir haben nur gehabt den Aussagenkalkiil mit der Erweiterung auf 
freie Variable d.h. solche fiir die beliebige Funktionale eingesetzt werden konnten. Es 
fehlt das Operieren mit „alle" und „es gibt". 

2. Wir haben das Induktionsschema hinzugefilgt, ohne W[iderspruchs]-f[reiheits] Beweis 
und auch nur provisorisch, also in der Absicht, es wegzuschaffen. 
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3. Bisher nur die arithmethischen Axiome genau [?] die sich auf ganze Zahlen beziehen. Und 
die obigen Mangel verhindem uns ja natiirlich die Analysis aufzubauen (Grenzbegriff, 
Irrationalzahl). 

Diese 3 Punkte liefern schon Disposition und Ziele fiir das Folgende. 

Wir wenden uns zu 1.) Es ist ja an sich klar, dass eine Logik otine „alle" — „es gibt" 
Stuckwerk ware, ich erinnere wie gerade in der Anwendung dieser Begriffe, und den sogennanten 
transflniten Schlussweisen die Hauptschwierigkeiten entstanden. Die Frage der Anwendbarkeit 
dieser Begriffe auf oo Gesamtlieiten tiaben wir nocli nictit betiandelt. Nun konnten wir so 
verfaiiren, wie wir es beim Aussagen-Kalkiil gemacht haben: einige, moglichst einfaciie [???] 
als Axiome zu formalisieren, aus denen sich [sic] dann alle tibrigen folgen. Dann miisste der 
W-f Beweis gefiihrt werden — unserem allgem[einen] Programm gemass: irrit unserer Einstellung, 
dass Beweis eine vorliegende Figur ist. Fiir den W-f Beweis grosse Schwierigkeiten wegen 
der gcbundcncn Variabchi. Die ticfcre Untersuchung zcigt abcr, dass der eigentliche Kern der 
Schwierigkeit an einer anderen Stelle liegt, auf die man gewohnlich erst spater Acht giebt und 
die auch in der Litteratur erst spater wahrgenommen worden ist."(Hilbert and Bemays 1923b, 
Erganzung, sheets 3-4). 

"[Dieser Kern liegt] beim Auswahlaxiom von Zermelo. [. . . ] Die Einwande richten sich 
gegen das Auswahlprinzip. Sie miiBten sich aber ebenso gegen „alle" und „es gibt" richten, 
wobei derselbe Grundgedanke zugrunde liegt. 

Wir wollen das Auswahlaxiom erweitem. Jeder Aussage mit einer Variablen A{a) ordnen 
wir ein Ding zu, fiir das die Aussage nur dann gilt, wenn sie allgemein gilt. Also ein Gegenbeispiel, 
wenn es existiert. 

e(A), eine individuelle logische Funktion. [...]£ geniige dem transflniten Axiom: 
(16) A(eA)->Afl 

z.B. Aa heiBe: a ist bestechlich. zA ist Aristides."(Hilbert and Bernays 1923a, 30-31). The 
lecture is dated February 1, 1922, given by Hilbert. The corresponding part of Hilbert's notes 
for that lecture in (Hilbert and Bemays 1923b, Erganzung, sheet 4)) contains page references 
to (Hilbert 1923, 152 and 156, paras. 4-6 and 17-19 of the English translation), and indicates 
the changes made for the lecture, specifically, to replace X by e. 
See section 3.4 on the e-substitution method. 
3" "Wenn wir eine Funktionsvariable haben: 

AtfAf^Af 

(71 fallt fort)? e komme nur mit 21 vor (z.B. /O = 0, //O = 0). Wie werden wir die Funktionsvariablen 
ausschalten? Statt fc setzen wir einfach c. Auf die gebundenen trifft das nicht zu. Fiir diese 
nehmen wir probeweise eine bestimmte Funktion z.B. 5a und fiihren damit die Reduktion 
durch. Dann steht z.B. 218 2l(p. Diese reduziert ist r[ichtig] oder f[alsch]. Im letzten Falle is 
2l(p falsch. Dann setzen wir uberall (p fiir e /2l/. Dann steht 2t(p — » 2t\|/. Das ist sicher rfichtig] 
da 2t(p f[alsch] ist." (Hilbert and Bemays 1923a, 38-39). 

For a more detailed survey of Ackermann's scientific contributions, see (Hermes, 1967). 
A very informative discussion of Ackermann's scientific correspondence can be found in 
(Ackermann, 1983). 

"In seiner Arbeit "Begriindung des „Tertium non datui" mittels der Hilbertschen Theorie 
der Widerspmchsfreiheit hat Ackermann im allgemeinsten Falle gezeigt, dass der Gebrauch 
der Worte „alle" und „es gibt", des „Tertium non datur" widerspmchsfrei ist. Der Beweis 
erfolgt unter ausschliesslicher Benutzung primitiver und endlicher Schlussweisen. Es wird 
alles an dem mathematischen Formalismus sozusagen direkt demonstriert. 

Ackermann hat damit unter Ueberwindung erheblicher mathematischer Schwierigkeiten 
ein Problem gelost, das bei den modemene auf eine Neubegriindung der Mathematik gerichteten 
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Bestrebungen an erster Stelle steht." Hilbert-NachlaB, Niedersachsische Staats- und Univer- 
sitatsbibliothek Gottingen, Cod. Ms. Hilbert 458, sheet 6, no date. The three-page letter was 
evidently written in response to a request by the President of the International Education 
Board, dated May 1, 1924. 

"Ich bemerke nur, dass Ackermann meine Vorlesungen iiber die Grundlagen der Math- 
[ematik] in den letzten Semestern gehort hat und augenblicklich einer der besten Herren der 
Theorie ist, die ich hier entwickelt habe." ibid., sheet 2. The draft is dated March 19, 1924, and 
does not mention Russell by name. Sieg (1999), however, quotes a letter from Russell's wife to 
Hilbert dated May 20, 1924, which responds to an inquiry by Hilbert concerning Ackermann's 
stay in Cambridge. Later in the letter, Hilbert expresses his regret that the addressee still has 
not been able to visit Gottingen. Sieg documents Hilbert's effort in the preceding years to 
effect a meeting in Gottingen; it is therefore quite likely that the addressee was Russell. 

^'^ Ackermann only requires that b be bound by the occurrence of 9, but this is not enough 
for his proof. 

According to Ackermann's definition of subordination, this would not be true. A subterm 
of c{b) might contain a bound variable and thus not be a constant subterm, but the variable 
could be bound by a function symbol in t other than the occurrence of (p under consideration. 
See note 4. 
36 Ackermann (1924b, 18) 

3' Tait (1981) argues that finitism coincides with primitive recursive arithmetic, and that 
therefore the Ackermann function is not finitistic. Tait does not present this as a historical 
thesis, and his conceptual analysis remains unaffected by the piece of historical evidence 
presented here. For further evidence (dating however mostly from after 1931) see Zach (1998, 
§5) and Tait's response in (2000). 

38 Ackermann (1924b, 13-14). 

3^ "[Gentzen fragt,] ob Sie der Meinung sind, dass sich die Methode des Endlichkeitsbeweises 
durch transfinite Induktion auf den Wf-Beweis Ihrer Dissertation anwenden lasse. Ich wiirde 
es sehr begriissen, wenn das ginge." Bemays to Ackermann, November 27, 1936, Bemays 
Papers, ETH Zurich LibraryAVHS, Hs 975.100. 

"Mir fallt iibrigens jetzt, wo ich gerade meine Dissertation zur Hand nehme, auf, dass 
dort in ganz ahnlicher Weise mit transfiniten Ordnungszahlen operiert wird wie bei Gentzen." 
Ackermann to Bemays, December 5, 1936, Bemays Papers, ETH Ziirich LibraryAVHS, Hs 
975.101. 

"Ich weiss iibrigens nicht, ob Ihnen bekannt ist (ich hatte das seiner Zeit nicht als Ueberschreitung 
des engeren finiten Standpunktes empfunden), dass in meiner Dissertation transfinite Schlusse 
benutzt werden. (Vgl. z.B. die Bemerkungen letzter Abschnitt Seite 13 und im nachstfolgenden 
Abschnitt meiner Dissertation." Ackermann to Bernays, June 29, 1938, Bernays papers, ETH- 
Ziirich, Hs 975.1 14. The passage Ackermann refers to is the one quoted above. 

The e-substitution method was subsequently refined by von Neumann (1927) and Hilbert 
and Bemays (1939). Ackermann (1940) gives a consistency proof for first-order arithmetic, 
using ideas of Gentzen (1936). See also (Tait, 1965) and (Mints, 1994). Useful introductions to 
the e-substitution method of Ackermann (1940) and to the e-notation in general can be found 
in Moser (2000) and Leisenring (1969), respectively. 

^3 Ackermann (1924b, 8). The 7t-functions were present in the earliest presentations in 
(Hilbert, 1922a) as the T-fimction and also occur in (Hilbert and Bemays, 1923a). They were 
dropped from later presentations. 

It is not clear whether the definition is supposed to apply to the formulas with free 
variables (i.e., to a = b and a = ei,{a = b) in the example) or to the corresponding substi- 
tuion instances. The proof following the definition on p. 21 of (Ackermann, 1924b) suggests 
the former, however, later in the procedure for defining a sequence of total substitutions it 
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is suggested that the e-expressions corresponding to formulas subordinate to 2t(a) receive 
substitutions — but according to the definition of a total substitution only e-terms (£^(3 = b) in 
the example) receive substitutions. 

The bookkeeping functions are introduced here and are not used by Ackermann. The 
basic idea is that that in case (3), substitutions for some formulas are discarded, and the next 
substitution is given the "last" total substitution where the substitution for the formula was 
not yet marked as discarded. Instead of explicit bookkeeping, Ackermann uses the notion of a 
formula being "remembered" as having its value not discarded. 

With the restriction on second-order e-terms imposed by Ackermann, and discussed be- 
low, the system for which a consistency proof was claimed is essentially elementary analysis, 
a predicative system. A consistency proof using the e- substitution method for this system was 
given by Mints and Tupailo (1996). 
'^'^ Ackermann (1924b, 9) 

Ackermann to Bemays, June 25, 1925, Bernays Papers, ETH Zurich, Hs. 975.96. 

"Ich habe augenblicklich den e/-Beweis wieder vorgenommen, und versuche mit aller 
Gewalt da zum AbschluB zu kommen. Da6 sich das Problem auf ein zahlentheoretisches 
reduzieren laBt, hatte ich Ihnen damals ja schon mitgeteilt. Den zahlentheoretischen Satz 
allgemein zu beweisen scheint mir aber ebenso schwierig wie das ganze Problem. Ich habe nun 
den schon mehrfach von inir versuchten Weg wieder eingeschlagen, den Begriff des Grundtyps 
so zu erweitem, das auch die e mit freien Funktionsvariablen eine Ersetzung bekoitmien. 
Dieser Weg scheint ja auch der natiirlichste, und die Glcichhcitsaxiome (/)(A(/) ^ B{f)) 
£fAf = ZfBf wiirden dann gleich mitbehandelt. Ich habe einige Hoffnung, daB die sich friiher 
auf diesem Weg einstellenden Schwierigkeiten vermieden werden konnen, wenn ich den Eq- 
Formalismus benutze und statt ohne e definierte Funktionen, solche zur Ersetzung fiir die E/ 
nehme, die ein Eq enthalten konnen. Ich habe mir aber erst einfache Spezialfalle tiberlegt." 
Ackermann to Bernays, March 31, 1926. ETH Zurich/WHS, Hs 975.97. Although Acker- 
mann's mention of "ground types" precedes the publication of von Neumann (1927), the latter 
paper was submitted for publication alread on July 29, 1925. 

"Letzthin habe ich mir Ihren neueren Beweis der Widerspruchsfr[eiheit] fur die E^ an 
Hand dessen, was Sie mir vor Ihrer Abreise aufschrieben, genauer uberlegt und glaube diesen 
Beweis als richtig eingesehen zu haben." Bernays to Ackermann, April 12, 1927, in the poses- 
sion of Hans Richard Ackermann. Bernays continues to remark on specifics of the proof, 
roughly, that when example substitutions for E-types are revised (the situation corresponding 
to case (3) in Ackermann's original proof), the substitutions for types of higher rank have to 
be reset to the initial substitution. He gives an example that shows that if this is not done, the 
procedure does not terminate. He also suggests that it would be more elegant to treat all types 
of the same rank at the same time and gives an improved estimate for the number of steps 
necessary. Note that the reference to "Eq's" (as opposed to Ef) suggest that the proof was only 
for the first-order case. A brief sketch of the proof is also contained in a letter from Bemays 
to Weyl, dated January 5, 1928 (ETH Zurich/WHS, Hs. 91.10a). 

^' "Wie Sie sich vielleicht erinnem, hatte ich damals einen 2. Beweis fiir die Widcrspruchsfreiheit 
der Efl. Dieser Beweis ist von mir nie publiziert worden, sondem nur Herrn Prof. Bernays 
miindlich mitgeteilt worden, der sich auch damals von seiner Richtigkeit ilberzeugte. Prof. 
Bemays schrieb mir nun im vcrgangenen Jahre, daB das Ergebnis ihm mit der Godelschen 
Arbeit nicht zu harmonisieren scheine." Ackermann to Hilbert, August 23, 1933, Hilbert- 
NachlaB, Niedersachsische Staats- und Universitatsbibliothek, Cod. Ms. Hilbert 1. Ackermann 
did not then locate the difficulty, and even a year and a half later (Ackermann to Bemays, 
December 8, 1934, ETH ZiirichAVHS, Hs 975.98) suggested a way that a finitistic consistency 
proof of arithmetic could be foimd based on work of Herbrand and Bemays's drafts for the 
second volume of Grundlagen. 
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^ "Problem I. The consistency proof of the e-axiom for the function variable /. We have 
the outline of a proof. Ackermann has already carried it out to the extent that the only re- 
maining task consists in the proof of an elementary finiteness theorem that is purely arithmeti- 
cal." Hilbert (1928), translated in Mancosu (1999a, 229). The extension to e-extensionality is 
Problem III. 

"Anlasslich der Arbeit ftir das Grundlagenbuch sah ich mich dazu angetrieben, den zweiten 
Hilbertschen Wf.-Beweis fiir das e- Axiom, den sogenannten „verungluckten" Beweis, nochmals 
zu iiberlegen, und es scheint mir jetzt, dass dieser sich doch richtig stellen lasst." Bemays 
to Ackermann, October 16, 1929, in the possession of Hans Richard Ackermann. Bemays 
continues with a detailed exposition of the proof, but concludes that the proof probably cannot 
be extended to include induction, for which e-substitution seems better suited. 

The sketch bears the title "Wf.-Beweis fiir das logische Auswahl-Axiom", and is inserted 
in the front of Elemente und Prinzipienfragen der Mathematik, Sommer-Semester 1910. Li- 
brary of the Mathematisches Institut, Universitat Gottingen, 16.206tl4. A note in Hilbert's 
hand says "Einlage in W.S. 1920." However, the e-Axiom used is the more recent version 
Ab — > ASaAa and not the original, dual AEgAa — > Ab. It is thus very likely that the sketch dates 
from after 1923. 

"Ich glaube, dass damit die Frage, die wir bei der Durchsprechung des modifizierten 
Ackermannschen Beweises zuletzt diskutierten, ob namlich eine Langen-Abschatzung ftir 
das Korrigier-Verfahren unabhangig von der Grosse der Zahlen-Substituenden gleichmassig 
mogUch sei, vemeinend beantwortet ist. An diesem Punkte ist dann der Nachweis des endlichen 
Abbrechens dieses Verfahrens (fiir den nachsten Grad, d.h. 3) jedenfalls liickenhaft.)" von 
Neumann to Bernays, March 10, 1931, Bernays Papers, ETH Zurich/WHS, Hs. 975.3328. 
Von Neumann's example can be found in Hilbert and Bernays (1939, 123). 
56 Hilbert (1928). 

5^ In a letter dated May 3, 1931, Bernays suggests that the problem lies with certain types 
of recursive definitions The Bernays-Godel correspondence will shortly be published in Vol- 
ume IV of Godel's collected works. For more on the reception of Godel's results by Bemays 
and von Neumann, see Dawson (1988) and Mancosu (1999b). 
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